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NITROREDUCTASE ENZYMES 



The present invention relates to polypeptides and proteins having 
nitroreductase activity, to DNA and genes encoding these nitroreductases 
and to methods of obtaining such enzymes, DNA and genes. 

A number of cancer therapies are based upon or exploit the conversion of 
a non-toxic prodrug into a toxic derivative. 

One example concerns the monofunctional alkylating agent CB1 954, which 
exhibits extreme toxicity towards the Walker 256 rat carcinoma as a result 
of the presence of a DT-diaphorase enzyme (DTD) which reduces the 4- 
nitro group of CB1954 to give a highly cytotoxic 4-hydroxylamine (4HX) 
derivative. CB1 954 does not have the same effect on human carcinomas 
because human cells lack this enzyme but would be effective against 
human tumours if an enzyme such as DTD were externally supplied, e.g. 
in a Directed-Enzyme Prodrug Therapy (DEPT). The rat DTD, however, has 
a relatively poor specific activity for CB1954. The E.coii B nitroreductase 
enzyme (NfnB) was isolated as a more effective alternative and is the 
subject of EP-A-0540263. It exhibits a higher specific activity for CB1 954, 
compared with the rat enzyme and is, therefore, currently the preferred 
enzyme in anti-cancer DEPT strategies. 

Whilst the known E.coii enzyme receives widespread attention from cancer 
biologists seeking to develop gene based DEPT strategies, it has a number 
of drawbacks. These mostly relate to its activity against the preferred 
prodrug, CB1954 - it has a relatively high K m and low K cat , and converts 
CB1 954 into equimolar amounts of a relatively innocuous 2-hydroxylamino 
derivative (2HX) in addition to the highly cytotoxic 4-hydroxylamino species 
(4HX). 

In relation to this specific prodrug, it is hence desired to provide an 



r 



WO 00/47725 PCT/GB00/00431 

- 2 - 

alternative to the known E.coli enzyme. 



Additionally, and more generally, analogues of CB1 954 and prodrugs other 
than CB1954 are known and further such precursors of potential toxic 
5 agents may become the focus of future therapies. In relation to all of these 

it . is desired to provide further enzymes capable of use in converting 
prodrugs into drugs, e.g. for clinical uses. 

It is an object of the present invention to provide nitroreductase enzymes, 
1 o in particular nitroreductase enzymes for converting CB1 954 and analogues 

thereof into drugs. It is a further object of the present invention to provide 
DNA and genes encoding nitroreductases, which DNA and genes in 
particular are incorporated into pharmaceutical compositions for prodrug 
therapies. 

15 

The present invention is based upon the discovery, purification, gene 
sequencing and/or expression of nitroreductases in bacteria and other 
microorganisms with hitherto unknown properties in converting prodrugs 
such as CB1954 into toxic derivatives. These nitroreductases posses 

20 properties which alone or in combination offer potential improvements 

compared with the known enzymes in this technology. The nitroreductases 
of the invention may be divided into different families based upon such 
characteristics as activity, product spectrum and/or amino acid sequence, 
and each given nitroreductase may fall into more than one of these 

25 families. 



The present invention provides, in a first aspect, a nitroreductase enzyme, 
characterised in that it preferentially reduces CB1 954 to a product that is 
a cytotoxic 4-hydroxylamine (4HX) derivative. 

30 

The enzymes of this aspect of the present invention confer the advantage 
that the product they generate from CB1 954 contains a greater proportion 
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of the cytotoxic 4HX derivative then the non-cytotoxic 2-hydroxylamino 
derivative. In preferred embodiments of the invention, the product is 
substantially entirely the cytotoxic derivative. The enzymes may hence be 
more efficient that those of the art as the enzymes of the invention 
5 produce more cytotoxic product for a given amount of pro-drug. 

The present invention further provides, in a second aspect, a nitroreductase 
enzyme, characterised in that it reduces a prodrug to a toxic derivative with 
a K m of less 700 micromolar, wherein the prodrug is selected from CB1 954 

10 and analogues thereof or other bioreductive drugs (Denny et al, B.J. 

Cancer, 1 996, 74, pp S32-S38). The enzymes of the second aspect of the 
invention offer an advantage over the known E. coli- derived enzyme in that 
they have a lower K m (K m of E.coli NfnB for CB1954 is around 862 
micromolar) and thus have a higher affinity for substrate. Twenty nitrogen 

1 5 mustard analogues of CB1 954 are described by Friedlos et al (J Med Chem, 

1997, 40, 1270-1275). 

More preferably, the K m of the enzymes of the second aspect of the 
invention is less than 300 micromolar. 



20 



25 



In a third aspect, the present invention provides a nitroreductase enzyme 
characterised in that it reduces a prodrug to a toxic derivative with a K ca: 
of at least 8, wherein the prodrug is selected from CB1954 and analogues 
thereof. 



The enzymes of this aspect of the invention offer an improvement over that 
of the art, specifically the E.coli enzyme, in that they have an improved K caT 
- i.e a higher value than for E.coli NfnB indicating a higher turnover of 
substrate by the enzyme. In preferred embodiments of this aspect of the 
30 invention, the K cat of the enzymes is at least 10. 



In a fourth aspect of the invention, there is provided a nitroreductase 
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enzyme characterised in that it reduces CB1954 to a toxic derivative, it 
reduces SN23862 to a toxic derivative, it can use NADH and/or NADPH as 
electron donor and in that it shares no more than 50% sequence identity 
with the £.co//NfnB sequence. Preferably; the sequence identity is about 
5 25% or less, this sequence identity being measured using the MEGALIGN 

(registered trade mark) software. 

It has already been discussed how the known E.coli nitroreductase is well 
characterised and is fully sequenced. The nitroreductases of the fourth 
1 0 aspect thus represent a class of enzymes having nitroreductase activity, or 

being nitroreductase-like, which nevertheless are so different in amino acid 
sequence from the E.coli enzyme as to represent a separate family of 
nitroreductases. 



15 This aspect of the invention thus advantageously provides a further class 

of nitroreductase enzymes for use e.g. in prodrug therapies. 



The invention still further provides, in a fifth aspect, a nitroreductase 
enzyme characterised in that it reduces CB1 954 or an analogue thereof to 
20 a toxic derivative, in that it shares at least 50% sequence identity with the 

rat DTD sequence and in that it does not contain a domain that is the same 
as or corresponds to amino acids 51 to 82 of the rat DTD sequence. 



Sequence identity is suitably measured in the same way as described above - 
25 in relation to the fourth aspect. 



To determine whether a given nitroreductase contains a domain that is the 
same as or corresponds to amino acids 51 to 82 of the rat DTD sequence, 
the amino acid sequence of the given nitroreductase and of the rat DTD 
30 sequence are aligned using a conventional sequence alignment program, 

such as MEGALIGN (registered trade mark) made by DNASTAR, Inc. 
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If the alignment program indicates that there are no amino acids in the 
given sequence that, following the algorhythm of the program, are held to 
correspond to those at positions 51-82 of the rat DTD sequence then it is 
concluded that the rat domain is lacking from the given sequence. 

5 

This aspect of the invention thus provides a further class of nitroreductase 
enzymes for conversion e.g. of prodrugs into drugs. A nitroreductase in 
this class may also be obtained by deleting amino acid residues that 
correspond to residues 51-82 of the rat DTD from a known mammalian 
10 enzyme. 

The nitroreductases of the invention may also be NADPH dependant. This 
property further distinguishes some enzymes of the invention from the 
known E.coii enzyme and the rat DTD. 

15 

It has been found that enzymes having one or more of the properties 
described may be obtained from bacteria of the family Bacillus, in particular 
a Bacillus selected from B. amyloliquefaciens, B. subtilis, B. pumilis, B. 
lautus, B. thermoflavus, B. licheniformis and B. alkophilus. This finding is 
20 of surprise in that at least three nitroreductase enzymes have been found 

in some species, in particular B. subtilis, B. lautus and B. pumilis, and as 
nitroreductases having the advantageous properties of the invention have 
not hitherto been identified in these bacteria, the currently used 
nitroreductase being obtained from E.coii. 

25 

In specific embodiments of the invention described in more detail below, 
a nitroreductase has a sequence selected from SEQ ID Nos 2, 4, 6, 8, 10, 
12, 14, 16, 17, 18, 19, 20, 21, 23, 25, 27 and 29. 

30 It has further been found that nitroreductases according to the invention 

may fall into more than one aspects of the invention. It is hence preferred 
that a nitroreductase of the invention possesses the properties of at least 
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two aspects of the invention, and more preferably at least three aspects of 
the invention. 

A specific embodiment of the invention is a nitroreductase of SEQ ID NO:2 
obtained from B. amyloliauefaciens this enzyme converts CD194 ,nto 
substantially only the cytotoxic derivative, hence falling into the first aspect 
of the invention, but also has a K m that is improved compared to the E.col, 
enzyme, hence falling also into the second aspect of the invent.on. 

A further specific embodiment of the invention is a nitroreductase from 
Bsubtifis SEQ ID NO:9. This enzyme has a better K cat than the E.col, 
enzyme, its K cat being about 15 compared with about 6 for the E.coli 
enzyme, and hence fails into the third aspect of the invention. Additionally, 
this enzyme falls into the fourth aspect of the invention in that it reduces 
both CB1954 and SN23862 but shares less than 30% sequence identity 
with the E.coli sequence. Another B.subtilis enzyme, SEQ ID NO:11 is 
similarly in both the third and fourth aspects of the invention, having a K c3 , 
of about 15. 

From the examples set out below it will be apparent how the further 
specific embodiments of the invention fall into at least two and even three 
aspects of the invention. 

The enzymes of the invention are of use in enzyme directed prodrug 
therapy. Accordingly, it is preferred that they are provided in purified form. 

A sixth aspect of the invention provides a pharmaceutical composition 
comprising a nitroreductase enzyme according to any of the first to fifth 
aspects of the invention in combination with a pharmaceutical^ acceptable 
30 carrier. 

As mentioned above, the nitroreductase of the invention are of use in 
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therapies such as directed-enzyme prodrug therapies. In these therapies, 
it is required to deliver the nitroreductase to the target site. This delivery 
can be achieved by delivering the enzyme itself or by delivering a DNA or 
gene coding for the enzyme. 

5 

In an example of the enzyme of the invention in use, a pharmaceutical 
composition is designed for a directed-enzyme prodrug therapy, and 
comprises a pharmaceutical acceptable carrier and a compound for 
converting a prodrug into a drug, wherein a compound is composed of at 
10 least a nitroreductase according to any of the first to fifth aspects of the 

invention conjugated to a targeting moiety. 



The targeting moiety can suitably comprise an antibody specific for a target 
cell. Alternatively, the targeting moiety is a moiety preferentially 
15 accumulated by or taken up by a target cell. 

A further example of delivery of the enzyme of the invention is achieved in 
a gene therapy-based approach for targeting cancer cells, as described in 
WO 95/12678. As described by Knox R.J. et al, the basis of this further 

20 prodrug therapy is delivery of a drug susceptibility gene into target, usually 

tumour or cancer, cells. The gene encodes a nitroreductase that catalyses 
the conversion of a prodrug into a cytotoxic derivative. The nitroreductase 
itself is not toxic and cytotoxicity used to treat the tumour cells arises after 
administration of a prodrug which is converted into the cytotoxic form. A 

25 bystander effect may be observed as cytotoxic drug may diffuse into 

neighbouring cells. 



Thus, in this gene-based therapy, the nitroreductase is expressed inside a 
cell, in contrast to other delivery systems in which, for example, the 
30 enzyme itself is delivered accompanied by a targeting moiety. 



Targeting of gene-based therapies may be achieved by providing a virus or 
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liposome with altered surface components so that the delivery vehicle is 
recognised by target cells. Typically, transcriptional elements are chosen 
so that the gene coding for the nitroreductase enzyme will be expressed in 
the target cells, and preferably substantially only in the target cells. A 
5 number of viral-based vectors are suitable for this delivery. Retro-viral 

based vectors typically infect replicating cells. Adenoviral vectors and 
lentiviral-vectors are also believed to be suitable. 



This delivery technology has been demonstrated by Bridgewater et al (Eur 
10 J Cancer 31a, 236-2370, 1995). A recombinant retrovirus encoding a 

nitroreductase was used to infect mammalian cells, it being observed that 
infected cells expressing the nitroreductase were killed by application of 
CB1954. 



1 5 Accordingly, a further aspect of the invention provides the use of a DNA 

sequence coding for a nitroreductase of the invention in manufacture of a 
medicament for prodrug therapy. 



The medicament may take the form of a viral vector, comprising a DNA 
20 encoding the nitroreductase of the invention operatively coupled to a 

promoter for expression of the DNA. The medicament may take the form 
of a mini-gene comprising a DNA operatively linked to a promoter for 
expression of the DNA, the mini-gene being suitable for inclusion or 
incorporation into a targeting vehicle such as a microparticle. 

25 

Thus, an embodiment of the invention provides a viral vector comprising 
a nucleotide sequence encoding a nitroreductase according to any of 
aspects 1 to 5 of the invention, which nitroreductase converts a prodrug 
into a cytotoxic drug, and also a kit comprising the viral vector and the 
30 prodrug, and also a method of treatment of tumours which comprises 

administering an effective amount of the viral vector together with an 
effective amount of the prodrug. 
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The preparation and administration of these viral vectors may be 
substantially as described in WO 95/12678, the contents of which is 
incorporated herein by reference. The present invention relates to 
providing nitroreductase enzymes and genes and DNA coding therefore. 
5 The uses of those enzymes and genes may be as set out in WO 95/1 2678. 

A nitroreductase can also be delivered by putting a gene of the invention 
into a bacteria that selectively colonises tumours, such as a clostridial 
(Lemmon et al, Gene Therapy, 1997, 4, 791-796) or Salmonella species. 

A further aspect of the invention provides an isolated DNA encoding a 
nitroreductase according to any of the first to fifth aspects of the invention. 
The DNAs of this further aspect of the invention, and also the DNAs 
incorporated into vectors of the invention, preferably comprise a sequence 
which is selected from SEQ ID NOs: 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 22, 24, 26 
or 28, together with fragments, derivatives and analogs thereof retaining 
nitroreductase activity according to one of the first to fifth aspects of the 
invention. The fragments, derivatives and analogs are suitably selected 
from sequences which retain at least 70% identity with the specific 
embodiments of the invention, or preferably at least 90% identity and most 
preferably at least 95% identity. 

The enzymes of the invention can also be obtained by purification from cell 
extracts and may also be obtained by recombinant expression of DNA. A 
25 still further aspect of the invention lies in a method of preparing a 

nitroreductase enzyme, comprising expressing a gene in a bacterial cell, 
wherein the gene codes for a nitroreductase enzyme of the invention. 

In an example of the invention described below in more detail, the gene 
30 expressed is a Bacillus gene or is a gene obtained by substitution, deletion 

and/or addition of nucleotides in or to a Bacillus gene. 
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The invention also provides the use of a nitroreductase according to any of 
the aspects of the invention in manufacture of a medicament for anti- 
tumour therapy, and the use of a compound comprising a nitroreductase 
according to any aspect of the invention conjugated to a targeting moiety 
5 in manufacture of a medicament for anti-tumour therapy. 



The invention is now illustrated by the following specific examples and in 
the accompanying sequence listing in which: 

SEQ ID NO: 2 is a nitroreductase from B.amyloliquefaciens (coded 
10 for by SEQ ID NO: 1) and designated "Bam YrwO"; 

SEQ ID NO: 4 is a nitroreductase from B.subtifis (coded for by SEQ 
ID NO: 3) and designated n Bs YwrO"; 

SEQ ID NO: 6 is a nitroreductase from B.subtilis (coded for by SEQ 
ID NO: 5) and designated "YrkL"; 
15 SEQ ID NO: 8 is a nitroreductase from B.subtilis (coded for by SEQ 

ID NO: 7) and designated "YdeQ"; 

SEQ ID NO: 10 is a nitroreductase from B.subtilis (coded for by SEQ 
ID NO: 9) and designated "Ydg!"; 

SEQ ID NO: 1 2 is a nitroreductase from B.subtilis (coded for by SEQ 
20 ID NO: 1 1) and designated "YodC M ; 

SEQ ID NO: 14 is a nitroreductase from E.coli (coded for by SEQ ID 
NO: 13) and designated "YabF" 

SEQ ID NO: 1 6 is a nitroreductase from E.coli (coded for by SEQ ID 
NO: 15) and designated "YheR"; 
25 SEQ ID NO: 1 7 is a nitroreductase from H. influenzae; 

SEQ ID NO: 18 is a nitroreductase from T.aquaticus; 
SEQ ID NO: 1 9 is a nitroreductase from Synechocystis sp PCC 6803; 
SEQ ID NO: 20 is a nitroreductase from A.fufgidus; 
SEQ ID NO: 21 is a nitroreductase from A.fufgidus. 
30 SEQ ID NO: 23 is a nitroreductase from Campylobacter jejuni (coded 

for by SEQ ID NO: 22); 

SEQ ID NO: 25 is a nitroreductase from Porphyromonas gingivaiis 
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(coded for by SEQ ID NO: 24); 

SEQ ID NO: 27 is a nitroreductase from Yersinia pestis (coded for by 

SEQ ID NO: 26); and 

SEQ ID NO: 29 is a nitroreductase from Helicobacter pylori (coded 

5 for by SEQ ID NO: 28). 

The invention is also illustrated by reference to the accompanying Tables 
1-4 and Figures 1 and 2, in which Figs 1 and 2 show sequence 
comparisons as set out in more detail in Example 8. 
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Example 1 



A Nitroreductase Enzyme/Gene from Bacillus amyloiiquefaciens 

Briefly, extracts of Bacillus amyloiiquefaciens were shown to possess 
15 nitroreductase activity. To purify this activity, crude cell extracts were 

subjected to ammonium sulphate, fractionation and anion exchange 
chromatography. The purified material was subject to N-terminal amino 
acid sequence analysis and the information obtained used to cloned the 
gene via a PCR-based strategy. Following determination of its nucleotide 
20 sequence the gene was overexpressed in E. coli and the resultant 

recombinant protein purified and characterised see table 1 . 

This analysis showed that the enzyme had properties which were distinct 
from that of E.coli NfnB. Thus the protein had a more favourable K m for 
25 CB1 954 (1.5-fold lower than the E. coli B NfnB) and furthermore converted 

CB1954 into the 4HX form alone. It also differed from the E. coli B NfnB 
in that the enzyme showed no activity against the prodrug SN23862. 

The isolated enzyme/gene represents a significant improvement over the 
30 E.coli NfnB enzyme with respect to its activity against the prodrug CB1954 

ie., it produces only the 4HX derivative and has an improved K m for 
CB1954. 
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A comparison of the amino acid sequence of the isolated enzyme revealed 
that it shared a very low level of homology to the rat DTD (c. 25%), but 
exhibited high homology (70% sequence identity) with the predicted 
product of a gene that has been discovered in the Bacillus subtilis genome 
sequencing project, designated ywrO. On this basis, we have designated 
the cloned Bacillus amyloliquefaciens gene ywrO, and its encoded enzyme 
YwrO. 



YwrO BAM is a tetrameric flavoprotein (monomeric molecular mass 
10 approximately 22.5 kDa by SDS-PAGE, native molecular mass 

approximately 90 kDa by gel filtration). Although it shares sequence 
homology with rat DTD it differs in its enzymic properties in that it can use 
only NADPH as cof actor (K m 40 jjM). In common with DTD it can reduce 
CB1954 but not SN23862, reduction of CB1954 resulting in formation of 
15 the 4HX product only <K m 617/yM, k cat 8.2). It shows a high affinity for the 

quinone menadione (K m 3AjjM) and has azoreductase and flavin reductase 
activity (K m for FMN 53 >uM, K m for FAD 209 a/M). 

In more detail, N-terminal amino acid sequencing of the purified Bacillus 
20 amyloliquefaciens nitroreductase enzyme resulted in the following 

sequence^et-Lys-Val-Leu-Val-Leu-Ala-Val-His-Pro-Asp-Met-Glu-Asn-Ser- 
Ala-VakAsn. When this sequence was used to search available protein 
databases strong homology was noted with the predicted amino acid 
sequence of a hypothetical protein, YrkL, identified in the Bacillus subtilis. 
25 genome sequencing project. Significant homology was also evident with 

two proteins, YabF and YheR, identified during the course of the 
determination of the Escherichia coli genome. These three hypothetical 
proteins shared weak homology with a number of mammalian quinone 
reductases and NAD(P)H-oxidoreductases, such as the rat DTD. 

30 

In view of this observation, a strategy was formulated whereby sequence 
homology between the identified bacterial proteins, together with the 
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determined N-terminal amino acid sequence of the discovered Bacillus 
amyloliquefaciens enzyme, was used to amplify a region of the desired 
encoding gene from the Bacillus amyfoliquefaciens genome. The one primer 
utilised in PCR was a degenerate oligonucleotide sequence which 
5 corresponded to a DNA sequence capable of coding for the N-terminal 

octa-peptide Val-His-Pro-Asp-Met-Glu-Asn. It was composed of the 
following nucleotides, 5'-GTNCAYCCNGATATGGARAA-3\ where Y 
indicates the presence of a T or C, R indicates the presence of A or G, and 
N indicates the presence of either T, C, G or A. The second primer was 
10 based on the hypothetical sequence His-Gly-Trp-Ala-Tyr-Gly which was 

found to be entirely conserved between the hypothetical bacterial proteins 
YrkL {Bacillus subtilis) and YabF [E.coli), and partially conserved in YheR 
(E.coli). The degenerate oligonucleotide mixture synthesised corresponded 
to the antisense DNA coding strand, viz., 5'-CCRTANGCCCANCCRTG-3' . 

15 

E.coli YheR (90-95) Arg Gly Phe Ala Ser Gly 

E.coli YabF (84-89) His Gly Trp Ala Tyr Gly 

B. subtilis YrkL (85-90) His Gly Trp Ala Tyr Gly 



20 The two primers were employed in PCR using chromosomal DNA isolated 

from Bacillus amyloliquefaciens and an amplified DNA fragment of the 
expected size (approximately 230 bp) obtained. This was cloned into 
plasmid pCR2. 1 TOPO (Invitrogen) and its nucleotide sequence determined. 
Translation of the sequence obtained demonstrated the presence of an 

25 open reading frame which encoded a polypeptide which shared 66% 

sequence similarity with YrkL. 

To obtain the entire structural gene, an approach was employed based on 
inverse PCR. In essence, B. amyloliquefaciens DNA was cleaved with the 
30 restriction enzyme Styl and the fragments generated circularised through 

their subsequent incubation with DNA ligase. The ligated DNA was then 
used as the template for a PCR employing two divergent primers based on 
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the sequenced 220 bp fragment. These were BamNTFM 1 (5'- 
GCTTATTGACCGCTGAG-3') and BamNTRI 4 <5'-GTACAGTGCGCCTCCGC- 
3'). A 2.9 kb fragment was generated, cloned into pCR2.1TOPO 
(Invitrogen) and the sequence of the insert determined. This allowed the 
5 identification of the nucleotide sequence of the remaining parts of the B. 

amyloiiquefaciens gene. Using this information, a contiguous copy of the 
entire structural gene was amplified from the B. amyloiiquefaciens 
chromosome using primers which encompassed the translational start 
codon (5'-GGTGTGATACATATGAAAGTATTG-3') and resided 3' to the 

10 translational stop codon (5'-CGGGGATTCGAATTCTTTCTCAGG-3'>. The 

primer at the 5'-end of the gene was designed such the sequence 
immediately 5' to the ATG start codon became CAT. This change created 
an Nde\ restriction site (CATATG), thereby allowing the cloning of the gene 
into the equivalent site of the expression vector pMTL1015. This 

15 manipulation facilitated the subsequent overexpression of the gene, as 

insertion of the gene at this point positions the start codon at an optimum 
distance from the vector borne ribosome binding site. 



The strategy employed to clone the BM YwrO gene could be similarly 
20 employed to clone further genes encoding novel nitroreductases. This 

would involve purifying the desired enzyme activity from a cell lysate, and 
then determining the N-terminal sequence. The data obtained could then 
be used to design an oligonucleotide primer corresponding to the sense 
strand of the DNA encoding part or all of the determined amino acid 
25 sequence. This primer could then be used, in conjunction with a second 

primer, to amplify part of the gene encoding the nitroreductase from the 
chromosome of the bacterial host using PCR. The second primer would 
correspond to the antisense strand of an internal portion of the targeted 
gene. Its design would be based on regions of homology which are 
30 conserved amongst the type of nitroreductase family that is sought. Thus, 

in the case of the DTD-like family, the oligonucleotide would, for example 
be based on the conserved motif His-Gly-Trp-Ala-Tyr-Gly (ie., amino acid 
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residues 85-90 in the BS YrkL protein). In the case of the NfnB-like family, 
the oligonucieotdie could be based on the motif Glu-Arg-Tyr-Val-Pro-Val- 
Met (ie., amino acid residues 170-176 in the BS YodC protein). 

Such amplified fragments could then be cloned and sequenced, and new 
primers designed based on this sequence to isolate the flanking regions of 
the gene by PCR. Once these have been cloned and sequenced, the entire, 
contiguous structural gene may be amplified using primers which extend 
beyond the 5' and 3' end of the translational start and stop codons. 

Cloning of genes encoding novel nitroreductases may also be achieved 
without recourse to N-terminal sequencing of the enzyme, or even its 
purification. This would involve basing the sequence of both of the 
oligonucleotides used in the initial PCR reaction on amino acid sequence 
motifs conserved amongst the two identified nitroreductase families. 
Thus, in the case of the NfnB-like family, a sense primer (eg., 5'- 
ATTTCTAAAGAAGAGCTGACGGAA-3') based on the motif lle-Ser-Lys-Glu- 
Glu-Leul-Thr-Glu (ie., amino acid residues 1 3 to 20 of BS YodC) could be 
employed with the an antisense primer (eg., 5'- 
CATTACCGGTACATAGCGTTC-3') based on the sequence motif Glu-Arg- 
Tyr-Val-Pro-Val-Met (ie., amino acid residues 170 to 176). In the case of 
the DTD-family a sense primer (eg., 5'-CATCCGGATATGGAAAAT-3') 
based on the motif His-Pro-Asp-Met-Glu-Asn (ie., amino acid residues to 9 
to 14 of BM YwrO) could be employed with the an antisense primer (eg., 
5'-TCCATATGCCCATCCATA-3') based on the sequence motif Tyr-Gly- 
Trp-Ala-Tyr-Gly (ie., amino acid residues 85 to 90). Once amplified, the 
rest of the gene could be isolated using the same procedure as outlined 
above. 

Example 2 



Bacillus subtilis Nitroreductases 
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As indicated above in Example 1, comparative analysis of the B. subtilis 
genome sequence with the amino acid sequence of the isolated B. 
amyloliquefaciens enzyme demonstrated the existence of an enzyme 
(YwrO) which shared 70% sequence identity. Unexpectedly, B. subtilis was 
found to possess two homologues, YrkL and YdeQ, which share 54% and 
51% sequence homology, respectively, with the B. amyloliquefaciens 
enzyme. All three enzymes share no homology with the E.coli B NfnB. 
They do, however, exhibit weak similarity (c. 25%) to the rat DT- 
Diaphorase (DTD). Whilst these proteins share a low level of sequence 
similarity to DTD, and other mammalian equivalents, they are 
characteristically smaller. This is because of the absence of an extensive 
internal protein domain at the N-terminus of the protein. Thus, the 
functional equivalent domain of the rat DTD between amino acid residues 
51 to 82, are missing from the BM YwrO protein. In addition, the rat DTD 
has an extra COOH-terminal domain. These bacterial enzymes are thus 
distinct from their mammalian equivalents. 

A further analysis of the B, subtilis genome, demonstrated that two 
homologues of the E.coli NfnB gene were present. Their encoded proteins 
(Ydgl and YodC) share a barely detectable level of sequence conservation 
with EC NfnB, of around 20% sequence identity. 

Bacillus subtilis was thus found to carry at least 5 different enzymes with 
nitroreductase activity. These are split into two families, thus- 
DTD-like - 3 members:- YwrO, YrkL, YdeQ 
NfnB-like - 2 members:- Ydgl, YodC 

Example 3 



Recombinant Production of Nitroreductases from Bacillus subtifis 

The DNA encoding all 5 B. subtilis nitroreductase enzymes were cloned 
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from genomic DNA using PCR and the resultant genes, following 
authentication by nucleotide sequencing, subcloned into a propriety 
CAMR expression vector (pMTL1015). The expression clones generated 
have been used to overproduce each of the 5 proteins and the enzymic 
activity of each assessed in crude lysates. This analysis has demonstrated 
that whilst the B.subtilis YwrO shares similar properties to the B. 
amyloliquefaciens homologue (ie., converts CB1 954 to the 4HX derivative 
alone, but is inactive against SN23862), YrkL and YdeQ have no activity 
against either of the two prodrugs tested (CB1 954 or SN23862) but they 
may be active against other prodrugs. 

Despite the extremely limited sequence similarity to EC NfnB, Ydgl and 
YodC are active against both CB1954 and SN23862. They do, however, 
produce both the 2HX and 4HX derivatives of CB1954. Their 
characterisation has shown that they turn over CB1 954 at higher rates than 
EC NfnB (YodC k cat 58, Ydgl k cat 30.3 cf 6 for NfnB). Both show a high 
affinity for menadione and flavins, but they differ in that whereas Ydgl uses 
both NADH and NADPH, YodC shows a preference for the latter. The 
native molecular mass of YodC (approximately 90kDa) indicates that it is 
tetrameric (molecular mass estimated from amino acid sequence and by 
SDS-PAGE being approximately 22 kDa) whereas Ydgl appears to be a 
dimer in the native state (molecular mass by gel filtration approximately 49 
kDa). 

These finding are further illustrated in Table 2. 
Example 4 

Bacillus lautus & Bacillus pumifis nitroreductases 

From 103 soil sample isolates tested, two strains (Bacillus pumilis CP044 
and Bacillus lautus CP060) had been previously chosen as possessing 
extracts which showed the most rapid reduction of both CB1954 and 
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SN23862. Purification experiments demonstrated that the activity in both 
extracts was distributed across three distinct peaks. The presence of more 
than one enzyme activity is consistent with our discovery of multiple forms 
of proteins in Bacillus able to turnover prodrugs. Eventual purification of the 
three enzymes of B. pumifis CP044 revealed that no one candidate 
exhibited properties which were an improvement on the E.coll NfnB 
enzyme. In contrast, the proteins in peak 1 and peak 3 of the B.lautus 
CP060 were determined to offer advantage over NfnB. 

Thus, whilst the enzyme in peak 1 did not produce the required 4HX 
derivative of CB1954, it exhibited a 4-fold lower Km with the prodrug 
SN23862. The enzyme of peak 3 was, however, deemed to be of greatest 
value as it converted CB1 954 solely into the 4HX derivative and had a Km 
approximately 4-fold lower than NfnB. Furthermore, it also had activity 
against SN23862. In this respect it shares the properties of both the 
Bacillus DTD-like family (ie., it produces only the 4HX derivative) and the 
NfnB-like family (ie., it is active against SN23862) - these findings are 
illustrated in Table 3. 

Example 5 

N-terminal Sequencing of B. lautus Nitroreductase 

Electrophoretic separation of the peak 3 demonstrated that 4 protein bands 
were present which could account for the observed prodrug activity. All 
four were subjected to N-terminal amino acid sequencing and the activity 
localised to the fourth protein band from which the nitroreductase may be 
purified. 

Example 6 



Detection of Nitroreductase Activity in Thermophile Extracts 

As an alternative source novel enzymes, a preliminary screen of CAMRs 
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thermophile collection was undertaken. Enzymes from this source may 
have the advantage of greater stability, and therefore longevity of action. 
Strains were selected on the basis either of sensitivity to CB1 954, or those 
which are resistant but which impart a yellow/golden coloration to agar 
5 containing prodrug. 

Two of these strains (B. thermoflavus and B. licheniformis) generated the 
cytotoxic 4HX form and were selected for further study. 

10 Example 7 

Identification Of Further Nitroreductase Enzymes 

Having identified the two families of nitroreductase in Bacillus, a search 
was undertaken of both finished and unfinished genomes for homologues, 
using YwrO and YodC/NfnB. On the basis of this search homologues of 
YwrO were identified in the genomes of Yersinia pestis and Porphyromonas 
gingivalis, and homologues of NfnB in the genomes of Pyrococcus furiosus, 
Haemophilus influenza, Synechocystis PCC 6803, Campylobacter jejuni, 
Archaeglobus, Helicobacter pylori, Heliocbacter fulgidus and Thermus 
aquaticus. 

In addition to the above, two E.coli genes were found to be homologues of 
rat DTD and YwrO, and were designated Yher and YabF. They were 
discovered to share the characteristic of YwrO in that they lack the internal 
25 protein domain found in the rat DTD enzyme and functional mammalian 

homologues. 

(i) P. gingivalis YwrO homologue 

P. gingivalis YwrO homologue is a dimeric fiavoprotein with native molecular 
mass estimated by gel filtration at 40 kDa. Although it shares sequence 
homology with DTD and forms only the 4HX reduction product of CB1 954 



15 



20 
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(K m 1200//M, k caT 3.2), it differs from DTD in that it is active with SN23862 
and it can only use NADH as cofactor (cf DTD which can use either NADH 
or NADPH and is inactive with SN23862). It can reduce azodyes but it is 
inactive with menadione or flavins. 

(ii) C. jejuni NfnB homologue 

C. jejuni NfnB homologue produces only the 4HX reduction product of 
CB1954 (K m 143 /yM, k cat 11.2) using NADPH as cofactor and it is also 
active with SN23862. it can use the quinone menadione as substrate as 
well as azodyes and the flavins FMN and FAD. 

(iii) Archaeogiobus fuigidus NfnB homologue 

Archaeogfobus fuigidus NfnB homologue is a dimeric flavoprotein of 42 kDa 
native molecular mass, producing the 4HX derivative of CB1954 only (K m 
690 /vM, k cat 56.2) using NADPH as cofactor. It is also active with 
SN23862 and menadione (K m 9 //M), but does not decolourise azodyes and 
has only weak flavin reductase activity. 

(iv> /-/.influenzae and H. pylori NfnB homologues 

Both these enzymes are dimeric flavoproteins and form the 4HX reduction 
product of CB1954 using NADPH in preference to NADH, but have no 
activity with azodyes. The former also lacks activity with the quinone 
menadione and flavins FMN or FAD. Both however have weak activity 
with SN23862 and may be active with other prodrugs. 

(v) Y.pestis nfnB homologue and Synechocystis YwrO homologue 

Both these proteins reduce CB1954 but produce only the relatively non- 
toxic 2HX derivative using NADPH as cofactor. They do however show 
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activity with SN23862 and the former can also reduce azodyes. 



Example 8 

Comparison of Nitroreductase Sequences 

We compared the amino acid sequences of nitoreductases according to the 
invention with each other and with known rat, human and E.cofi 
sequences, and the results are illustrated in Figures 1 and 2. In Figure 1 , 
rat, mouse and two human sequences make up the first four lanes for 
comparison purposes. It is evident that nitroreductases of the invention are 
lacking a sequence from positions 51-82 of the rat sequence. 

In Figure 2, sequences of nitroreductases of the invention are compared 
with the known E.coii sequence, which is designated nfmB in the second- 
to-last lane. 

The invention thus provides nitroreductase enzymes, DNA and genes 
therefor and methods of obtaining such enzymes and of using the enzymes 
and DNA coding therefor in clinical applications. 



SUBSTITUTE SHEET (RULE 26) 



WO 00/47725 



PCT/GB00/00431 



-22 - 



ENZYME 


M.Wt 


CB1954 


SN23862 


ACTIVITY 


(kDa) 


Product 


Km 


Kin 


B. pumilis CP044 










Peak 1 


ND 


4HX 


v. low 


ND 


Peak 2 


ND 


4HX 


>1000 


ND 


Peak 3 


ND 


2/4HX 


999 


ND 


B.lautus CP060 










Peak 1 


35 


2HX 


211 


325 


Peak 2 


42 


4HX 


>2000 


none 


Peak 3 


47 


4HX 


257 


active 



Table 3: Fractionation of nitroreductase activity in cell extracts of Bacillus lautus and Bacillus pumilis 



STRAIN 


CB1954 


SN23862 


Product 


NADH 


NADPH 


NADH 


NADPH 


1078 


2/4HX 


13.8 


22.6 


8.5 


17.6 


2122 a 


2/4HX 


36.6 


56.0 


33.4 


62.8 


6012 b 


4>2HX 


15.2 


37.8 


8.2 


35.2 


6013 c 


2HX 


9.8 


49.4 


6.4 


39.0 


6031 d 


2HX 


11.9 


42.1 


8.2 


33.8 


6036 


2HX 


10.7 


26.7 


7.3 


26.2 


6044 


2HX 


4.0 


21.3 


4.5 


9.9 



Table 4: Characteristics of nitroreductase activity of thermophiles identified as being sensitive to CB1954 
[Identified as Bacillus thermoflavus a > Bacillus licheniformis b, Bacillus licheniformis c , Bacillus alkophilus d] 
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ENZYME 


M.Wt 


CB1954 


SN23862 


ACTIVITY 


(kDa) 


Product 


Km 


Km 


B. pumilis CP044 










Peak 1 


ND 


4HX 


v. low 


ND 


Peak 2 


ND 


4HX 


>1000 


ND 


Peak 3 


ND 


2/4HX 


999 


ND 


B.lautusCP06i) 










Peak 1 


35 


2HX 


211 


325 


Peak 2 


42 


4HX 


>2000 


none 


Peak 3 


47 


4HX 


257 


active 



Table 3: Fractionation of nitroreductase activity in cell extracts of Bacillus lautus and Bacillus pumilis 



STRAIN 


CB1954 


SN23862 


Product 


NADH 


NADPH 


NADH 


NADPH 


1078 


2/4HX 


13.8 


22.6 


8.5 


17.6 


2122 a 


2/4HX 


36.6 


56.0 


33.4 


62,8 


6012 b 


4>2HX 


15.2 


37.8 


8.2 


35.2 


6013 c 


2HX 


9.8 


49.4 


6.4 


39.0 


6031 <* 


2HX 


11.9 


42.1 


8.2 


33.8 


6036 


2HX 


10.7 


26.7 


7.3 


26.2 


6044 


2HX 


4.0 


21.3 


4.5 


9.9 



Table 4: Characteristics of nitroreductase activity of thermophiies identified as being sensitive to CB 1 954 
[Identified as Bacillus thermoflavus a , Bacillus licheniformis Bacillus licheniformis c , Bacillus alkophilus d] 
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CLAIMS 

1 . A nitroreductase characterised in that it preferentially reduces 
CB1 954 to a cytotoxic 4-hydroxylamine (4HX) derivative instead of a non- 

5 cytotoxic 2-hydroxylamine derivative. 

2. A nitroreductase according to Claim 1 further characterised in that 
it reduces CB1954 to the 4HX derivative with a K m of less than 700 
micromolar. 

10 

3. A nitroreductase according to Claim 1 or 2 further characterised in 
that it is NADPH dependant. 

4. A nitroreductase according to any of Claims 1 to 3, further 
15 characterised in that it reduces CB1954 to a cytotoxic 4-hydroxylamine 

(4HX) derivative substantially without producing the non-cytotoxic 2- 
hydroxyiamine derivative. 

5. A nitroreductase according to any of Claims 1 to 4 which reduces 
20 the prodrug to the toxic derivative with a Kcat of at least 8. 

6. A nitroreductase according to any of Claims 1 to 5, which reduces 
CB1954 or an analogue thereof to a toxic derivative, shares at least 50% 
sequence identity with the rat DTD sequence and does not contain a 

25 domain that is the same as or corresponds to amino acids 51 to 82 of the 

rat DTD sequence. 

7. A nitroreductase characterised in that it reduces a prodrug to a toxic 
derivative with a K m of less 700 micromolar, wherein the prodrug is 

30 selected from CB1954 and analogues thereof. 

8. A nitroreductase according to Claim 7 which reduces the prodrug to 

SUBSTITUTE SHEET (RULE 26) 
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the toxic derivative with a K m of less 300 micromolar. 

9. A nitroreductase according to Claim 7 or 8 which reduces the 
prodrug to the toxic derivative with a Kcat of at least 8. 

5 

10. A nitroreductase according to Claim 9 which reduces the prodrug to 
the toxic derivative with a Kcat of at least 10. 



11. A nitroreductase according to any of Claims 7 to 10, further 
10 characterised in that it reduces CB1954 to a toxic derivative, it reduces 

SN23862 to a toxic derivative, it can use both NADH and NADPH as 
electron donor and in that it shares no more than 30% sequence identity 
with the E.coff NfnB sequence. 

15 12. A nitroreductase according to any of Claims 7 to 1 1 further 

characterised in that it shares at least 50% sequence identity with the rat 
DTD sequence and in that it does not contain a domain that is the same as 
or corresponds to amino acids 51 to 82 of the rat DTD sequence. 

20 1 3. A nitroreductase characterised in that it reduces a prodrug to a toxic 

derivative with a Kcat of at least 8. 

1 4. A nitroreductase according to Claim 1 3, further characterised in that 
it reduces CB1954 to a toxic derivative, it reduces SN23862 to a toxic 
25 derivative, it can use both NADH and NADPH as electron donor and in that 

it shares no more than 30% sequence identity with the E.coli NfnB 
sequence. 

1 5. A nitroreductase according to Claim 1 3 or 1 4, further characterised 
30 in that it reduces CB1954 or an analogue thereof to a toxic derivative, in 

that it shares at least 50% sequence identity with the rat DTD sequence 
and in that it does not contain a domain that is the same as or corresponds 
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to amino acids 51 to 82 of the rat DTD sequence. 



1 6. A nitroreductase characterised in that it reduces CB1 954 to a toxic 
derivative, it reduces SN23862 to a toxic derivative, it can use both NADH 
5 and NADPH as electron donor and in that it shares no more than 30% 

sequence identity with the E.coli NfnB sequence. 

17. A nitroreductase according to Claim 16, wherein the sequence 
identity is about 25% or less. 

10 

18. A nitroreductase characterised in that it reduces CB1954 or an 
analogue thereof to a toxic derivative, in that it shares at least 50% 
sequence identity with the rat DTD sequence and in that it does not 
contain a domain that is the same as or corresponds to amino acids 51 to 

15 82 of the rat DTD sequence. 

1 9. Use of a DNA sequence coding for a nitroreductase according to any 
preceding Claim in manufacture of a medicament for prodrug therapy. 

20 20. A viral vector, comprising 

(a) a DNA encoding nitroreductase according to any of 
Claims 1 to 1 8 operatively coupled to 

(b) a promoter for expression of the DNA. 

25 21. A mini-gene comprising 

(a) a DNA encoding nitroreductase according to any of 
Claims 1 to 18 operatively coupled to 

(b) a promoter for expression of the DNA. 

30 22. A pharmaceutical composition comprising a nitroreductase according 

to any of Claims 1 to 1 8 in combination with a pharmaceutical^ acceptable 
carrier. 
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23. A pharmaceutical composition for use in a directed-enzyme prodrug 
therapy, comprising a pharmaceutically acceptable carrier and a compound 
for converting a prodrug into a drug, wherein a compound comprises a 
nitroreductase according to any of Claims 1 to 1 8 conjugated to a targeting 

5 moiety. 

24. A pharmaceutical composition according to Claim 23 wherein the 
targeting moiety comprises an antibody specific for a target cell. 

10 25. A pharmaceutical composition according to Claim 23 wherein the 

targeting moiety is a moiety preferentially accumulated by or taking up by 
a target cell. 

26. A method of preparing a nitroreductase, comprising expressing a 
15 gene in a bacterial cell, wherein the gene codes for a nitroreductase 

according to any of Claims 1 to 18. 

27. Use of a nitroreductase according to any of Claims 1-18 in 
manufacture of a medicament for anti-tumour therapy. 



20 



28. Use of a compound comprising a nitroreductase according to any of 
Claims 1 to 1 8 conjugated to a targeting moiety in manufacture of a 
medicament for anti-tumour therapy. 
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DTD-Like Proteins 



uru-M ra t NAD(PlH-cuinone oxi do reductase 1 . (brown rai): 
The aligned proteins are: NQOWat NAD{P) ™". NQQ1 hurnan , NAD(P)H-quinone 
NQ01 mouse, NAD(P)H-quinone oxidoreducxase 1 {mouse) JUUi "u ^ (numan}; yersinia, 
oxidoriductase 1 (human); NQ02_human, NAD(P)H^u,none oxj^edu^ 
un . named homo , 09 u e iYerstn/a pests); yheR.Ecoii yheR 
(Bac^s^); ywrO^amylo, ywrO 

ydeGsubtii, yceG (Bacillus subtiiis); Porpn_gmg, unearned nomoiogue v . 
and; yabFJEcoii, yabF [Escherichia coif) 
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ydgX-Bs 
yodC-B8 
Syaecliocyatia 

Taq 

Haam_inf 



ydgJ-Bs 
yodC-B8 
Synachocystia 

Taq 

Sal^typhim 
nfafl 

Haam_ia£ 



ydgl-BS 
yodC-3s 
Syaachocystis 
Taq 

of nB 



ydgl-Ba 

yodC-Bs QKQi 
Synachocyatis • a ^3 El ^ 



.20. 




afaB 



181 130. 



NfnB-Uke Proteins 

The aligned proteins are: ydgl-BS, ydgi {Bacillus subrilisY. yodCBs. yodC fflac//te subrilisY. 
SynechocysTis. orgA <Sy„ec/70cy<rt* PCC 6803); Taq, NOX_THETH (77)er/nw w»&c:«): 
Sal_TYphim. nfn8 [Salmonella ryphimurium); nfnb_entcf. nfnB {£nterooacrer cloacae)', nrnB, 
{Escherichia col/ B). and; Haernjnf, YC78_HA£1N [Haemophilus influenzae). 
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SEQUENCE LISTING 

<110> Microbiological Research Authority 

<120> Nitroreductase Enzymes 

<130> gws/21226-seq 

<140> 
<141> 

<160> 27 

<170> Patentln Ver . 2.1 

<210> 1 
<211> 525 
<212> DNA 

<213> Bacillus amyloliquef aciens 

<220> 

<221> CDS 

<222> (1) . . (525) 

<r400> 1 

gtg aaa gta ttg gta tta gcg gtt cac cct gac atg gag aac tea gcg 48 

Met Lvs Val Leu Val Leu Ala Val His Pro Asp Met Glu Asn Ser Ala 

1 5 10 15 

gtc aat aag gca tgg gca gaa gaa tta aaa aaa cat gat gaa etc acg 96 
Val Asn Lys Ala Trp Ala Glu Glu Leu Lys Lys His Asp Glu Leu Thr 
20 25 " 30 

gtc cgt gag ctt tat aaa gaa tat ccg gac ggg caa ate gat gcg gaa 144 
Val Arg Glu Leu Tyr Lys Glu Tyr Pro Asp Gly Gin lie Asp Ala Glu 
35 40 45 

aag gaa cgt cag ctg tgt gaa cag tat gac egg ate gta ttt caa ttt 192 
Lys Glu Arg Gin Leu Cys Glu Gin Tyr Asp Arg lie Val Phe Gin Phe 
50 55 "* 60 

ccg ctg tat tgg tac agt gcg cct ccg ctt tta aaa aca tgg atg gat 240 
Pro Leu Tyr Trp Tyr Ser Ala Pro Pro Leu Leu Lys Thr Trp Met Asp 
65 70 75 80 

cat gtg ctg teg tac ggc tgg gee tac ggc tec aaa gga aag gcg ctg 28 8 
His Val Leu Ser Tyr Gly Trp Ala Tyr Gly Ser Lys Gly Lys Ala Leu 
85 ~ 90 95 

cat ggc aaa gaa ttg atg ctg get gtt tec gta ggt gec gga gag gat 33 6 
His Gly Lys Glu Leu Met Leu Ala Val Ser Val Gly Ala Gly Glu Asp 
100 105 110 

gca tac cag gca gga ggg tea aac cac ttt aca ttg age gag ctg tta 3 84 
Ala Tyr Gin Ala Gly Gly Ser Asn His Phe Thr Leu Ser Glu Leu Leu 
115 120 125 

agg ccg ttt cag gca atg get aat ttt aca ggt atg acc tat teg ccg 4 32 
Arg Pro Phe Gin Ala Met Ala Asn Phe Thr Gly Met Thr Tyr Leu Pro 
130 135 '* 140 

get ttc gcg ctg tac ggt gta aat ggg gcg gat gcg acg gat att cat 4 80 
Ala Phe Ala Leu Tyr Gly Val Asn Gly Ala Asp Ala Thr Asp lie His 
145 150 155 160 

gac aat gec aaa cgt ctg get get tac ata aag aaa tea ttt taa 525 
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Asp Asn Ala Lys Arg Leu Ala Ala Tyr lie Lys Lvs Ser Phe 

165 170 175 



<210> 2 
<211> 175 
<212> PRT 

<213> Bacillus amyloliquef aciens 
<400> 2 

Val Lys Val Leu Val Leu Ala Val His Pro Asp Met Glu. Asn Ser Ala 
15 10 15 

Val Asn Lys Ala Trp Ala Glu Glu Leu Lys Lys His Asp Glu Leu Thr 
20 25 30 

Val Arg Glu Leu Tyr Lys Glu Tyr Pro Asp Gly Gin lie Asp Ala Glu 
35 . 40 45 

Lys Glu Arg Gin Leu Cys Glu Gin Tyr Asp Arg lie Val Phe Gin Phe 
50 ' 55 60 

Pro Leu Tvr Trp Tyr Ser Ala Pro Pro Leu Leu Lys Thr Trp Met Asp 
65 * ' 70 75 80 

His Val Leu Ser Tyr Gly Trp Ala Tyr Gly Ser Lys Gly Lys Ala Leu 
85 90 95 

His Gly Lys Glu Leu Met Leu Ala Val Ser Val Gly Ala Gly Glu Asp 
100 105 110 

Ala Tyr Gin Ala Gly Gly Ser Asn His Phe Thr Leu Ser Glu Leu Leu 
115 120 125 

Arg Pro Phe Gin Ala Met Ala Asn Phe Thr Gly Met Thr Tyr Leu Pro 
130 135 140 

Ala Phe Ala Leu Tyr Gly Val Asn Gly Ala Asp Ala Thr Asp lie His 
145 150 155 160 

Asp Asn Ala Lys Arg Leu Ala Ala Tyr lie Lys Lys Ser Phe 

165 170 175 



<210> 3 
<211> 528 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . <528) 

<400> 3 

atg aaa ata ttg gtt ttg gca gtg cat 
Met Lys lie Leu Val Leu Ala Val His 

1 5 

gtt aat aag gcg tgg get gag gaa ttg 
Val Asn Lys Ala Trp Ala Glu Glu Leu 
20 25 

gta egg gat ctt tat aag gaa tac ccg 
Val Arg Asp Leu Tyr Lys Glu Tyr Pro 
35 40 

aag gaa cag cag ctg tgc gag gaa tac 



cct cat atg gag acc tea gtt 48 

Pro His Met Glu Thr Ser Val 

10 15 

agt aaa cat gac aat ate aca 96 

Ser Lys His Asp Asn lie Thr 
30 

gat gaa gcg ata gat gtt gcg 144 

Asp Glu Ala He Asp Val Ala 
45 

gat egg att gtc ttt caa ttc 192 
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Lvs Glu Gin Gin Leu Cys Glu Glu Tyr Asp Arg lie Val Phe Gin Phe 
50 55 * 60 

ccg eta tat tgg tac age tct ccg ccg etc ttg aaa aaa tgg cag gat 240 
Pro Leu Tyr Trp Tyr Ser Ser Pro Pro Leu Leu Lys Lys Trp Gin Asp 
65 70 75 80 

ctt gtg ctg act tau ggc tgg get ttt ggt tea gaa gga aat gee ttg 28 8 
Leu Val Leu Thr Tyr Gly Trp Ala Phe Gly Ser Glu Gly Asn Ala Leu 
85 " 90 95 

cat ggc aag gag ctg atg ctg get gta tea aca ggg age gaa gca gaa 33 6 
His Glv Lvs Glu Leu Met Leu Ala Val Ser Thr Gly Ser Glu Ala Glu 
100 105 110 

aaa tat caa gcg ggc gga gca aat cat tac teg ate agt gag eta ttg 3 84 
Lys Tyr Gin Ala Gly Gly Ala Asn His Tyr Ser lie Ser Glu Leu Leu 
115 120 125 

aaa cca ttt cag gee acg agt aat ctg ate ggc atg aag tat ctg cct 432 
Lys Pro Phe Gin Ala Thr Ser Asn Leu lie Gly Met Lys Tyr Leu Pro 
130 135 140 

cca tat gtg ttc tat ggc gtg aat tat gca get gca gag gat att tct 480 
Pro Tyr Val Phe Tyr Gly Val Asn Tvr Ala Ala Ala Glu Asp lie Ser 
145 150 155 160 

cac agt gca aaa egg tta gee gaa tac ate cag cag cct ttt gtt taa 52 8 
His Ser Ala Lys Arg Leu Ala Glu Tyr lie Gin Gin Pro Phe Val 
165 170 175 

<210> 4 
<211> 176 
<;212> PRT 

<213> Bacillus subtilis 
<400> 4 

Met Lys lie Leu Val Leu Ala Val His Pro His Met Glu Thr Ser Val 
15 10 15 

Val Asn Lys Ala Trp Ala Glu Glu Leu Ser Lys His Asp Asn lie Thr 
20 25 30 

Val Arg Asp Leu Tyr Lys Glu Tyr Pro Asp Glu Ala lie Asp Val Ala 
35 " ' 40 45 

Lys Glu Gin Gin Leu Cys Glu Glu Tyr Asp Arg lie Val Phe Gin Phe 
50 55 60 

Pro Leu Tyr Trp Tyr Ser Ser Pro Pro Leu Leu Lys Lys Trp Gin Asp 
65 70 75 80 

Leu Val Leu Thr Tyr Gly Trp Ala Phe Gly Ser Glu Gly Asn Ala Leu 
85 * 90 95 

His Gly Lvs Glu Leu Met Leu Ala Val Ser Thr Gly Ser Glu Ala Glu 
100 105 110 

Lys Tyr Gin Ala Gly Gly Ala Asn His Tyr Ser lie Ser Glu Leu Leu 
115 120 125 

Lys Pro Phe Gin Ala Thr Ser Asn Leu lie Gly Met Lys Tyr Leu Pro 
130 13 5 - 14 0 

Pro Tyr Val Phe Tyr Gly Val Asn Tyr Ala Ala Ala Glu Asp lie Ser 
145 150 155 160 
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His Ser Ala Lys Arg Leu Ala Glu Tyr lie Gin Gin Pro Phe Val 
165 170 175 

<210> 5 
<211> 525 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (525) 

<400> 5 

atg aaa aca tta gtt ate gtt ata cat cct aat ttg gaa acg tct gtt 48 

Met Lys Thr Leu Val lie Val lie His Pro Asn Leu Glu Thr Ser Val 
15 10 15 

gtc aac aaa acc tgg atg aat cgt tta aag caa gag aaa gac att acg 96 
Val Asn Lys Thr Trp Met Asn Arg Leu Lys Gin Glu Lys Asp lie Thr 
20 25 30 

gtt cat gac ctg tac ggt gaa tac cct aat ttt ate att gat gta gaa 144 
Val His Ast) Leu Tyr Gly Glu Tyr Pro Asn Phe lie lie Asp Val Glu 
35 40 45 

aaa gag cag cag etc ctg tta gat cat gag cgt ate gtt ttt cag ttc 192 
Lys Glu Gin Gin Leu Leu Leu Asp His Glu Arg lie Val Phe Gin Phe 
50 55 60 

cca atg tat tgg tac age agt ccc gcg tta etc aaa caa tgg gaa gat 24 0 
Pro Met Tvr Trp Tyr Ser Ser Pro Ala Leu Leu Lys Gin Trp Glu Asp 
65 " 70 75 80 

gat gtg tta aca cat ggc tgg get tat gga act gga gga act aaa ttg 288 
Asp Val Leu Thr His Gly Trp Ala Tyr Gly Thr Gly Gly Thr Lys Leu 
85 90 95 

cat gga aaa gaa eta etc tta get ate tec tea ggc gca cag gaa tct 336 
His Gly Lys Glu Leu Leu Leu Ala lie Ser Ser Gly Ala Gin Glu Ser 
100 105 110 

gat tat caa gca ggc gga gaa tat aat ate acg ate age gag ctt ate 3 84 
Asp Tyr Gin Ala Gly Gly Glu Tyr Asn lie Thr lie Ser Glu Leu lie 
115 120 125 

aga ccg ttt caa gtc act get aac tat ata gga atg cgt ttt ctt cct 432 
Arg Pro Phe Gin Val Thr Ala Asn Tyr lie Gly Met Arg Phe Leu Pro 
130 135 140 

gcg ttt aca caa tat ggg aca ctt cat ctt tea aaa gaa gat gtt aag 480 
Ala Phe Thr Gin Tyr Gly Thr Leu His Leu Ser Lys Glu Asp Val Lys 
145 150 155 160 

aac agt gcg gag aga ttg gtt gac tat ctt aaa gee gag cat taa 525 
Asn Ser Ala Glu Arg Leu Val Asp Tyr Leu Lys Ala Glu His 

165 170 175 

<210> 6 
<211> 175 
<212> PRT 

<213> Bacillus subtilis 
<400> 6 

Met Lys Thr Leu Val lie Val He His Pro Asn Leu Glu Thr Ser Val 
15 10 15 
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Val Asn Lys Thr Trp Met Asn Arg Leu Lvs Gin Glu Lys Asp lie Thr 
20 25 30 

Val His Asp Leu Tyr Gly Glu Tyr Pro Asn Phe He He Asp Val Glu 
35 40 45 

Lys Glu Gin Gin Leu Leu Leu Asp His Glu Arg He Val Phe Gin Phe 
50 55 60 

Pro Met Tyr Trp Tyr Ser Ser Pro Ala Leu Leu Lys Gin Trp Glu Asp 
65 70 75 80 

Asp Val Leu Thr His Gly Trp Ala Tyr Gly Thr Gly Gly Thr Lys Leu 
85 90 95 

His Gly Lys Glu Leu Leu Leu Ala lie Ser Ser Gly Ala Gin Glu Ser 
100 105 110 

Asp Tyr Gin Ala Gly Gly Glu Tyr Asn He Thr He Ser Glu Leu He 
115 " 120 125 

Arg Pro Phe Gin Val Thr Ala Asn Tyr He Gly Met Arg Phe Leu Pro 
130 135 140 

Ala Phe Thr Gin Tyr Gly Thr Leu His Leu Ser Lys Glu Asp Val Lys 
145 150 155 160 

Asn Ser Ala Glu Arg Leu Val Asp Tyr Leu Lys Ala Glu His 

165 " 170 175 

<210> 7 
<211> 594 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (594) 



<400> 7 

atg gat cat atg aaa aca etc gta etc gtt gta cat ccg aat ata gaa 

Met Asp His Met Lys Thr Leu Val Leu Val Val His Pro Asn He Glu 
15 10 15 



48 



tec tct cgt ate aat aaa aag tgg aaa gaa gec gtt tta agt gaa cca 96 

Ser Ser Arg He Asn Lys Lys Trp Lys Glu Ala Val Leu Ser Glu Pro 

20 25 30 

gat gta act gtc cat gat ctt tat gaa aaa tat cgc gat caa cca att 144 

Asp Val Thr Val His Asp Leu Tyr Glu Lys Tyr Arg Asp Gin Pro He 
35 40 45 

gat gtg gaa ttt gaa caa cag cag etc ctg gee cat gac cgt ate gtt 192 

Asp Val Glu Phe Glu Gin Gin Gin Leu Leu Ala His Asp Arg He Val 
50 55 60 

ttt cag ttt cca tta tac tgg tac age age cca ccg ctt tta aaa cag 240 

Phe Gin Phe Pro Leu Tyr Trp Tyr Ser Ser Pro Pro Leu Leu Lys Gin 

65 70 75 80 

tgg ttt gat gaa gtg ttt acg ttt ggc tgg get cat ggt ccc ggc gga 288 

Trp Phe Asp Glu Val Phe Thr Phe Gly Trp Ala His Gly Pro Gly Gly 

85 90 95 

aat aaa ttg aag ggg aaa gag tgg gta act gee atg tec ate ggt tea 336 

Asn Lys Leu Lys Gly Lys Glu Trp Val Thr Ala Met Ser He Gly Ser 
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100 105 11C 

cct gaa cac tct tat caa gec ggc gga tat aac ttg ttt teg ata age 3S4 
Pro Glu His Ser Tvr Gin Ala Glv Glv Tyr Asn Leu ?he Ser He Ser 
115 120 125 

gag ctg aca aaa ccg ttc caa gca tct gec cat tta gta ggc atg acc 432 
Glu Leu Thr Lys Pro Phe Gin Ala Ser Ala His Leu Val Gly Met Thr 
130 * 135 140 

tat ctg cct tec ttt gec gaa tat cgc gee aat aca ate agt gac caa 480 
Tyr Leu Pro Ser Phe Ala Glu Tyr Arg Ala Asn Thr He Ser Asp Gin 
145 150 155 160 

gaa att gee gaa agt gcg aat egg tat gta aag cat att aca aat ata 528 
Glu He Ala Glu Ser Ala Asn Arg Tyr Val Lys His He Thr Asn lie 
165 170 175 

gaa tta aac ccg aag gtt cgc ctg caa agg tat ttg aaa cag ctg gag 576 
Glu Leu Asn Pro Lys Val Arg Leu Gin Arg Tyr Leu Lys Gin Leu Glu 
180 1B5 190 

agt gtc gat tta aca taa 594 
Ser Val Asp Leu Thr 

195 



<210> 8 
<211> 198 
<212> PRT 

<213> Bacillus subtilis 
<400> 8 

Met Asp His Met Lys Thr Leu Val Leu Val Val His Pro Asn He Glu 
15 10 15 

Ser Ser Arg He Asn Lys Lys Trp Lys Glu Ala Val Leu Ser Glu Pro 

20 25 30 

Asp Val Thr Val His Asp Leu Tyr Glu Lys Tyr Arg Asp Gin Pro He 
35 40 45 

Asp Val Glu Phe Glu Gin Gin Gin Leu Leu Ala His Asp Arg He Val 
50 t 55 60 

Phe Gin Phe Pro Leu Tyr Trp Tyr Ser Ser Pro Pro Leu Leu Lys Gin 
65 70 75 80 

Trp Phe Asp Glu Val Phe Thr Phe Gly Trp Ala His Gly Pro Gly Gly 
85 90 95 

Asn Lys Leu Lys Gly Lys Glu Trp Val Thr Ala Met Ser He Gly Ser 
100 105 110 

Pro Glu His Ser Tyr Gin Ala Gly Gly Tyr Asn Leu Phe Ser He Ser 
115 " 120 125 

Glu Leu Thr Lys Pro Phe Gin Ala Ser Ala His Leu Val Gly Met Thr 
130 " 135 140 

Tyr Leu Pro Ser Phe Ala Glu Tyr Arg Ala Asn Thr He Ser Asp Gin 
145 150 155 160 

Glu He Ala Glu Ser Ala Asn Arg Tyr Val Lvs His He Thr Asn He 
165 170 175 

Glu Leu Asn Pro Lys Val Arg Leu Gin Arg Tyr Leu Lys Gin Leu Glu 
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180 



185 



190 



Ser Val Asp Leu Thr 





195 


<210> 


9 


c21I> 


630 


<212> 


DNA 


<213> 


Bacillus subtilis 


<220> 




<221> 


CDS 


<222> 


(1) . - (630) 


<400> 


9 


atg ate aaa aca aac gat 


Met He Lvs Thr Asn Asp 


1 


5 



48 



10 



15 



ate cgc aac tat gat ccg gca gta aaa ate age aaa gaa gaa atg aca 96 

lie Arg Asn Tyr Asp Pro Ala Val Lys He Ser Lys Glu Glu Met Thr 

20 25 3 0 

gag ate tta gag gaa gca aca act gee cca tct tct gtt aac gcg cag 144 

Glu He Leu Glu Glu Ala Thr Thr Ala Pro Ser Ser Val Asn Ala Gin 

35 40 45 

cca tgg cgt ttt ctt gtc att gac age ccg gaa gga aaa gaa aag etc 192 

Pro Trp Arg Phe Leu Val He Asp Ser Pro Glu Gly Lys Glu Lys Leu 

50 55 60 



gca ccg ctt gca age ttt aac caa aca caa gtc aca aca tea tct get 
Ala Pro Leu Ala Ser Phe Asn Gin Thr Gin Val Thr Thr Ser Ser Ala 
65 70 75 80 

gtc ate get gta ttt gca gac atg aac aac gca gac tat eta gaa gaa 
Val He Ala Val Phe Ala Asp Met Asn Asn Ala Asp Tyr Leu Glu Glu 
85 90 95 

ate tat tea aaa gec gtg gaa ctt ggt tac atg ccg cag gag gtc aaa 
He Tyr Ser Lys Ala Val Glu Leu Gly Tvr Met Pro Gin Glu Val Lys 
100 105 HO 

gac aga caa ate gee gcg ctg acc gca cat ttt gaa aag ctt ccg gca 
Asp Arg Gin He Ala Ala Leu Thr Ala His Phe Glu Lys Leu Pro Ala 
115 120 125 

cag gtc aac cgt gaa acg ate ctg att gac gga ggt ctt gtt tec atg 
Gin Val Asn Arg Glu Thr He Leu He Asp Gly Gly Leu Val Ser Met 
130 " 135 140 

cag ctg atg ctg act gca cgc gcg cat ggc tac gat aca aac ccg ate 
Gin Leu Met Leu Thr Ala Arg Ala His Gly Tyr Asp Thr Asn Pro He 
145 150 155 160 

ggc gga tac gat aaa gaa aac ate gcg gaa acc ttc gga tta gat aaa 
Gly Gly Tvr Asp Lys Glu Asn He Ala Glu Thr Phe Gly Leu Asp Lys 
165 170 175 



240 



288 



336 



384 



432 



480 



528 



gaa cgt tat gta ccg gtt atg eta ctt tct ate gga aaa gca gca gac 
Glu Arg Tvr Val Pro Val Met Leu Leu Ser He Gly Lys Ala Ala Asp 
180 185 190 

gaa ggc tat get tec tac cgt ctg ccg att gat aca att gca gaa tgg 
Glu Gly Tvr Ala Ser Tvr Arg Leu Pro He Asp Thr He Ala Glu Trp 
195 200 205 



576 



624 
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aaa taa 630 
Lys 

210 

<210> 10 
<211> 210 
<212> PRT 

<213> Bacillus subtilis 
<400> 10 

Met He Lys Thr Asn Asp Phe Met Glu He Met Lys Gly Arg Arg Ser 
1.5 10 15 

He Arg Asn Tyr Asp Pro Ala Val Lys He Ser Lys Glu Glu Met Thr 
20 25 30 

Glu He Leu Glu Glu Ala Thr Thr Ala Pro Ser Ser Val Asn Ala Gin 
35 40 45 

Pro Trp Arg Phe Leu Val lie Asp Ser Pro Glu Gly Lys Glu Lys Leu 
50 55 60 

Ala Pro Leu Ala Ser Phe Asn Gin Thr Gin Val Thr Thr Ser Ser Ala 
65 70 75 80 

Val He Ala Val Phe Ala Asp Met Asn Asn Ala Asp Tyr Leu Glu Glu 
85 90 95 

He Tyr Ser Lys Ala Val Glu Leu Gly Tyr Met Pro Gin Glu Val Lys 
100 105 HO 

Asp Arg Gin He Ala Ala Leu Thr Ala His Phe Glu Lys Leu Pro Ala 
115 120 125 

Gin Val Asn Arg Glu Thr He Leu lie Asp Gly Gly Leu Val Ser Met 
130 " 135 140 

Gin Leu Met Leu Thr Ala Arg Ala His Gly Tyr Asp Thr Asn Pro He 
145 150 155 160 

Gly Gly Tyr Asp Lys Glu Asn He Ala Glu Thr Phe Gly Leu Asp Lys 
165 170 175 

Glu Arg Tyr Val Pro Val Met Leu Leu Ser lie Gly Lys Ala Ala Asp 
180 185 190 

Glu Gly Tyr Ala Ser Tyr Arg Leu Pro lie Asp Thr lie Ala Glu Trp 
195 " 200 205 

Lys 

210 

<210> 11 

<211> 609 

<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (609) 

<400> 11 

atg acg aat act ctg gat gtt tta aaa gca cgt gca tct gta aag gaa 4 8 

Met Thr Asn Thr Leu Asp Val Leu Lys Ala Arg Ala Ser Val Lys Glu 
15 10 15 
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tat gat aca aat gcc ccg ate cct aag gag gag ctg act gag eta tta 96 

Tyr Asp Thr Asn Ala Pro lie Ser Lys Glu Glu Leu Thr Glu Leu Leu 
20 25 30 

gac ctt gcc act aaa gcg cct tct get tgg aac ctt cag cat tgg cat 144 
Asp Leu Ala Thr Lys Ala Pro Ser Ala Trp Asn Leu Gin His Trp His 
35 40 45 

ttt aca gta ttc cac age gat gaa tea aaa gcg gag ctt ctt cct gta 192 
Phe Thr Val Phe His Ser Asp Glu Ser Lvs Ala Glu Leu Leu Pro Val 
50 55 60 

gcg tat aat caa aaa caa ate gtt gag tct tct get gtt gtt gcc att 240 
Ala Tyr Asn Gin Lys Gin He Val Glu Ser Ser Ala Val Val Ala He 
65 70 75 80 

tta ggc gat tta aag gca aat gaa aac ggt gaa gaa gtt tat get gaa 2 88 
Leu Gly Asp Leu Lys Ala Asn Glu Asn Gly Glu Glu Val Tyr Ala Glu 
85 90 95 

tta gca age caa ggc tat att acg gat gaa ate aaa caa aca ttg etc 336 
Leu Ala Ser Gin Gly Tyr He Thr Asp Glu He Lys Gin Thr Leu Leu 
100 105 110 

ggc caa ate aac ggt get tac caa age gag caa ttc gca cgt gat tec 3 84 
Gly Gin He Asn Gly Ala Tyr Gin Ser Glu Gin Phe Ala Arg Asp Ser 
115 120 125 

get ttc tta aat get tct tta get get atg cag ctt atg att gcc gca 432 
Ala Phe Leu Asn Ala Ser Leu Ala Ala Met Gin Leu Met He Ala Ala 
130 135 140 

aaa gca aaa ggt tat gac act tgc gca ate ggc gga ttt aac aaa gag 480 
Lys Ala Lys Gly Tyr Asp Thr Cys Ala He Gly Gly Phe Asn Lys Glu 
145 ' . 150 155 160 

cag ttc caa aag caa ttt gat ate agt gag cgc tat gtt ccg gtt atg 528 
Gin Phe Gin Lys Gin Phe Asp He Ser Glu Arg Tyr Val Pro Val Met 
165 170 175 

ctt att tea ate ggc aaa gca gtg aag cct gcg cat caa age aac cgt 576 
Leu He Ser He Gly Lys Ala Val Lys Pro Ala His Gin Ser Asn Arg 
180 185 190 

ctg ccg ctt tea aaa gta tea act tgg ctg taa 609 
Leu Pro Leu Ser Lys Val Ser Thr Trp Leu 
195 "* 200 

<210> 12 
<211> 203 
<212> PRT 

<213> Bacillus subtilis 
<400> 12 

Met Thr Asn Thr Leu Asp Val Leu Lys Ala Arg Ala Ser Val Lys Glu 
15 10 15 

Tyr Asp Thr Asn Ala Pro He Ser Lys Glu Glu Leu Thr Glu Leu Leu 
20 25 30 

Asp Leu Ala Thr Lys Ala Pro Ser Ala Trp Asn Leu Gin His Trp His 
35 40 45 

Phe Thr Val Phe His Ser Asp Glu Ser Lys Ala Glu Leu Leu Pro Val 
50 55 60 
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Ala Tyr Asn Gin Lys Gin lie Val Glu Ser Ser Ala Val Val Ala lie 
65 70 75 80 

Leu Gly Asp Leu Lvs Ala Asn Glu Asn Gly Glu Glu Val Tyr Ala Glu 
85 90 95 

Leu Ala Ser Gin Gly Tvr lie Thr Asp Glu lie Lys Gin Thr Leu Leu 
100 ' 105 110 

Gly Gin lie Asn Gly Ala Tyr 'Gin Ser Glu Gin Phe Ala Arg Asp Ser 
115 120 125 

Ala Phe Leu Asn Ala Ser Leu Ala Ala Met Gin Leu Met lie Ala Ala 
130 135 140 

Lys Ala Lys Gly Tyr Asp Thr Cys Ala lie Gly Gly Phe Asn Lys Glu 
145 150 155 160 

Gin Phe Gin Lys Gin Phe Asp lie Ser Glu Arg Tyr Val Pro Val Met 
165 170 175 

Leu lie Ser lie Gly Lys Ala Val Lys Pro Ala His Gin Ser Asn Arg 
180 " 185 190 

Leu Pro Leu Ser Lys Val Ser Thr Trp Leu 
195 200 

<210> 13 
<211> 555 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (555) 

<400> 13 

atg atg tct cag cca gcg aaa gtt ttg ctg ctg tat gcc cat ccg gaa 4 8 

Met Met Ser Gin Pro Ala Lys Val Leu Leu Leu Tyr Ala His Pro Glu 

1 5 10 15 

tct cag gac teg gtg gca aac egg gta ctg ctt aaa ccg gcc acg cag 96 
Ser Gin Asp Ser Val Ala Asn Arg Val Leu Leu Lys Pro Ala Thr Gin 
20 25 30 

etc age aat gtt acc gtg cac gac ctt tac gcg cac tat ccc gat ttt 144 
Leu Ser Asn Val Thr Val His Asp Leu Tyr Ala His Tyr Pro Asp Phe 
35 40 45 

ttt att gat ate ccc cgt gag cag gca tta ctg cgc gag cac gag gtg 192 
Phe lie Asp He Pro Arg Glu Gin Ala Leu Leu Arg Glu His Glu Val 
50 55 60 

att gtc ttt cag cat cct ctt tat acc tat age tgc ccg gcg eta ctg 240 
He Val Phe Gin His Pro Leu Tyr Thr Tyr Ser Cys Pro Ala Leu Leu 
65 70 75 80 

aaa gag tgg ctg gac egg gta tta agt cgt ggt ttt gcc age ggg ccg 28 8 
Lys Glu Trp Leu Asp Arg Val Leu Ser Arg Gly Phe Ala Ser Gly Pro 
85 90 95 

gga gga aac caa ctg gcg gga aag tac tgg cgt age gtg att acc acc 33 6 
Gly Gly Asn Gin Leu Ala Gly Lys Tyr Trp Arg Ser Val He Thr Thr 
100 105 110 

ggc gag ccg gaa agt get tac cgt tat gac gcg ctg aat cgc tac ccg 3 84 
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Glv Glu Pro Glu Ser Ala Tyr Arg Tvr Asp Ala Leu Asn Arg Tyr Pro 
115 120 125 

atg age gat gcg ctg cgc ccc ttt gaa ctg gcg gcg ggc atg tgc. egg 432 
Met Ser Asp Val Leu Arg Pro Phe Glu Leu Ala Ala Glv Met Cys Arg 
130 135 140 

atg cat ugg tta agt ccc ate att att tac tgg gcg aga egg caa age 480 
Met His Trp Leu Ser Pro lie lie lie Tyr Trp Ala Arg Arg Gin Ser 
145 150 155 160 

gca cag gag ctg gcg age cac gee aga gee tac ggt gac tgg ctg gca 528 
Ala Gin Glu Leu Ala Ser His Ala Arg Ala Tyr Gly Asp Trp Leu Ala 
165 170 175 

aat ccg ctg tct cca gga ggc cgc tga 5 55 

Asn Pro Leu Ser Pro Gly Gly Arg 

180 165 

<210> 14 
<211> 185 
<212> PRT 

<213> Escherichia coli 
<400> 14 

Met Met Ser Gin Pro Ala Lys Val Leu Leu Leu Tyr Ala His Pro Glu 
15 10 15 

Ser Gin Asp Ser Val Ala Asn Arg Val Leu Leu Lys Pro Ala Thr Gin 
20 25 30 

Leu Ser Asn Val Thr Val His Asp Leu Tyr Ala His Tyr Pro Asp Phe 
35 40 45 

Phe lie Asp lie Pro Arg Glu Gin Ala Leu Leu Arg Glu His Glu Val 
50 55 60 

lie Val Phe Gin His Pz-o Leu Tyr Thr Tyr Ser Cys Pro Ala Leu Leu 
65 70 75 80 

Lvs Glu Trp Leu Asp Arg Val Leu Ser Arg Gly Phe Ala Ser Gly Pro 
85 90 95 

Gly Gly Asn Gin Leu Ala Gly Lys Tyr Trp Arg Ser Val lie Thr Thr 
100 105 110 

Gly Glu Pro Glu Ser Ala Tyr Arg Tyr Asp Ala Leu Asn Arg Tyr Pro 
115 120 * 125 

Met Ser Asp Val Leu Arg Pro Phe Glu Leu Ala Ala Gly Met Cys Arg 
130 135 140 

Met His Trp Leu Ser Pro lie lie lie Tyr Trp Ala Arg Arg Gin Ser 
145 150 155 ~ 160 

Ala Gin Glu Leu Ala Ser His Ala Arg Ala Tyr Gly Asp Trp Leu Ala 
165 170 " 175 

Asn Pro Leu Ser Pro Gly Gly Arg 

180 " " 185 

<210> 15 
<211> 531 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . (531) 

<400> 15 

atg att etc ata att tat gcg cat ccg tat ccg cat cat tec cat gcg 48 

Met lie Leu lie lie Tyr Ala His Pro Tyr Pro His His Ser His Ala 
15 10 15 

aat aaa egg atg ctt gaa cag gca agg acg ctg gaa ggc gtc gaa att 96 
Asn Lys Arg Met Leu Glu Gin Ala Arg Thr Leu Glu Gly Val Glu lie 
20 25 30 

cgc tct ctt tat caa etc tat cct gac ttc aat ate gat att gee gee 144 
Arg Ser Leu Tyr Gin Leu Tyr Pro Asp Phe Asn lie Asp lie Ala Ala 
35 40 45 

gag cag gag gcg ctg tct cgc gec gat ctg ate gtc tgg cag cat ccg 192 
Glu Gin Glu Ala Leu Ser Arg Ala Asp Leu lie Val Trp Gin His Pro 
50 55 60 

atg cag tgg tac age att cct ccg etc etc aaa ctt tgg ate gat aaa 240 
Met Gin Trp Tyr Ser lie Pro Pro Leu Leu Lys Leu Trp lie Asp Lys 
65 ~ " 70 75 80 

gtt ttc tec cac ggc tgg get tac ggt cat ggc ggc acg gcg ctg cat 288 
Val Phe Ser His Gly Trp Ala Tyr Gly His Gly Gly Thr Ala Leu His 
85 90 95 

ggc aaa cat ttg ctg tgg gcg gtg acg ace ggc ggc ggg gaa age cat 336 
Gly Lys His Leu Leu Trp Ala Val Thr Thr Gly Gly Gly Glu Ser His 
100 105 110 

ttt gaa att ggt gcg cat ccg ggc ttt gat gtg ctg teg cag ccg eta 3 84 
Phe Glu lie Gly Ala His Pro Gly Phe Asp Val Leu Ser Gin Pro Leu 
115 120 125 

cag gcg acg gca ate tac tgc ggg ctg aac tgg ctg cca ccg ttt gee 432 
Gin Ala Thr Ala lie Tyr Cys Gly Leu Asn Trp Leu Pro Pro Phe Ala 
130 135 140 

atg cac tgc ace ttt att tgt gac gac gaa ace etc gaa ggg cag gcg 480 
Met His Cvs Thr Phe lie Cys Asp Asp Glu Thr Leu Glu Gly Gin Ala 
145 " 150 155 160 

cgt cac tat aag caa cgt ctg ctg gaa tgg cag gag gee cat cat gga 52 8 
Arg His Tyr Lys Gin Arg Leu Leu Glu Trp Gin Glu Ala His His Gly 
165 170 175 

tag 531 



<210> 16 
<211> 177 
<212> PRT 

<213> Escherichia coli 
<400> 16 

Met lie Leu lie lie Tyr Ala His Pro Tyr Pro His His Ser His Ala 
15 10 15 

Asn Lys Arg Met Leu Glu Gin Ala Arg Thr Leu Glu Gly Val Glu lie 
20 25 30 

Arg Ser Leu Tyr Gin Leu Tyr Pro Asp Phe Asn lie Asp lie Ala Ala 
35 40 45 
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Glu Gin Glu Ala Leu Ser Arg Ala Asp Leu lie Val Trp Gin Kis Pro 
50 55 60 

Met Gin Trp Tyr Ser lie Pro Pro Leu Leu Lys Leu Trp lie Asp Lys 
65 70 75 80 

Val Phe Ser His Gly Tro Ala Tyr Gly His Gly Gly Thr Ala Leu His 
85 " 90 95 

Gly Lvs His Leu Leu Trp Ala Val Thr Thr Gly Gly Gly Glu Ser His 
100 105 110 

Phe Glu lie Gly Ala His Pro Glv Phe Asp Val Leu Ser Gin Pro Leu 
115 120 125 

Gin Ala Thr Ala lie Tyr Cys Gly Leu Asn Trp Leu Pro Pro Phe Ala 
130 135 140 

Met His Cvs Thr Phe lie Cys Asp Asp Glu Thr Leu Glu Gly Gin Ala 
145 ' 150 155 160 

Arg His Tyr Lys Gin Arg Leu Leu Glu Trp Gin Glu Ala His His 
165 170 175 

Gly 

<210> 17 
<211> 222 
<212> PRT 

<213> Haemophilus influenzae 
<400> 17 

Met Thr Gin Leu Thr Arg Glu Gin Val Leu Glu Leu Phe His Gin Arg 
15 10 15 

Ser Ser Thr Arg Tyr Tyr Asp Pro Thr Lys Lys lie Ser Asp Glu Asp 
20 25 30 

Phe Glu Cys lie Leu Glu Cys Gly Arg Leu Ser Pro Ser Ser Val Gly 
35 ~ 40 45 

Ser Glu Pro Trp Lys Phe Leu Val lie Gin Asn Lys Thr Leu Arg Glu 
50 55 60 

Lys Met Lys Pro Phe Ser Trp Gly Met lie Asn Gin Leu Asp Asn Cys 
65 70 75 80 

Ser His Leu Val Val lie Leu Ala Lys Lye Asn Ala Arg Tyr Asp Ser 
85 90 95 

Gin Gin Gin Ala Ala Leu Thr Lys Tyr Lys Ala Leu Gin Glu Glu Asp 
100 105 110 

Met Lys Leu Leu Glu Asn Asp Arg Thr Leu Phe Asp Trp Cys Ser Lys 
115 120 125 

Gin Thr Tyr lie Ala Leu Ala Asn Met Leu Thr Gly Ala Ser Ala Leu 
130 135 140 

Gly lie Asp Ser Cys Pro lie Glu Gly Phe His Tyr Asp Lys Met Asn 
145 150 155 160 

Glu Cys Leu Ala Glu Glu Gly Leu Phe Asp Pro Gin Glu Tyr Ala Val 
165 170 175 

Lys Ser Arg Lys Gly Leu Asp Glu Val Val Lys Trp Val Gly 
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180 185 190 

<210> 18 
<211> 207 
<212> PRT 

<213> Thermus aquaticus 
<400> 18 

Met Glu Ala Thr Leu Pro Val Leu Asp Ala Lys Thr Ala Ala Leu Lys 
15 10 15 

Arg Arg Ser lie Arg Arg Tyr Arg Lys Asp Pro Val Pro Glu Gly Leu 
20 25 3 0 

Leu Arg Glu lie Leu Glu Ala Ala Leu Arg Ala Pro Ser Ala Trp Asn 
35 40 45 

Leu Gin Pro Trp Arg lie Val Val Val Arg Asp Pro Ala Thr Lys Arg 
50 ~ 55 60 

Ala Leu Arg Glu Ala Ala Phe Gly Gin Ala His Val Glu Glu Ala Pro 
65 ~ 70 75 80 

Val Val Leu Val Leu Tyr Ala Asp Leu Glu Asp Ala Leu Ala His Leu 
85 90 95 

Gin Lys Gin Ala lie Gin Arg Ala Phe Ala Ala Met Gly Gin Glu Ala 
100 105 110 

Arg Lys Ala Trp Ala Ser Gly Gin Ser Tyr lie Leu Leu Gly Tyr Leu 
115 120 125 

Leu Leu Leu Leu Glu Ala Tyr Gly Leu Gly Ser Val Pro Met Leu Gly 
130 135 140 

Phe Asp Pro Glu Arg Val Arg Ala lie Leu Gly Leu Pro Ser Arg Ala 
145 150 155 160 

Ala lie Pro Ala Leu Val Ala Leu Gly Tyr Pro Ala Glu Glu Gly Tyr 
165 " 170 175 

Pro Ser His Arg Leu Pro Leu Glu Arg 0 Val Val Leu Trp Arg 
180 185 0 190 

<210> 19 
<211> 212 
<212> PRT 

<213> Synechocystis PCC6803 
<400> 19 

Met Asp Thr Phe Asp Ala lie Tyr Gin Arg Arg Ser Val Lys His Phe 
15 10 15 

Asp Pro Asp His Arg Leu Thr Ala Glu Glu Glu Arg Lys Leu His Glu 
20 25 ~ 30 

Ala Ala lie Gin Ala Pro Thr Ser Phe Asn lie Gin Leu Trp Arg Phe 
35 40 45 

Leu lie lie Arg Asp Pro Gin Leu Arg Gin Thr lie Arg Glu Lys Tyr 
50 55 60 

Gly Asn Gin Ala Gin Met Thr Asp Ala Ser Leu Leu lie Leu Val Ala 
65 70 ~ 75 80 
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Ala Asp Val Asn Ala Trp Asp Lys Asp Pro Ala Arg Tyr Trp Arg Asn 
85 90 95 

Phe Tyr Gly Gly Lys Pro Gin Leu Gin Arg Asp Glu Ala Gin Arg Ser 
100 105 110 

lie Gly Met Ala Met Gin Asn Leu Met Leu Ala Ala Lys Ala Met Gly 
115 120 125 

Tyr Asp Ser Cys Pro Met lie Gly Phe Asp Leu Gin Lys Val Ala Glu 
130 135 140 

Leu Val Lys Leu Pro Ala Asp Tyr Ala lie Gly Pro Met Val Ala lie 
145 150 155 160 

Gly Lys Arg Tfar Glu Asp Ala Pro Gly Lys Arg Arg Ser Asn Ser Pro 
165 ~ 170 175 

Cys Leu Ala lie 
180 



<210> 20 
<211> 172 
<212> PRT 

<213> Archaeoglobus fulgidus 
<400> 20 

Met Glu Cys Leu Asp Leu Leu Phe Arg Arg Val Ser lie Arg Lys Phe 
15 10 15 

Thr Gin Asp Asp Val Asp Asp Glu He Leu Met Lys He Leu Glu Ala 
20 25 30 

Gly Asn Ala Ala Pro Ser Ala Gly Asn Leu Gin Ala Arg Asp Phe Val 
35 40 45 

Val He Arg Asn Pro Glu Thr Lys Lys Arg Leu Ala Met Ala Ala Leu 
50 55 60 

Lys Gin Met Phe He Ala Glu Ala Pro Val Val He Val Val Cys Ala 
65 70 75 80 

Asn Tyr Pro Arg Ser Met Arg Val Tyr Gly Glu Arg Gly Arg Leu Tyr 
85 90 95 

Ala Glu Gin Asp Ala Thr Ala Ala He Glu Asn He Leu Leu Ala Val 
100 105 110 

Thr Ala Leu Asn Leu Gly Ala Val Trp Val Gly Ala Phe Asp Glu Glu 
115 120 125 

Gin Val Ser Glu lie Leu Glu Leu Pro Glu Tyr Val Arg Pro Met Ala 
130 135 140 

He He Pro He Gly His Pro Ala Glu Asn Pro Ser Pro Arg Asn Arg 
145 150 155 160 

Tyr Pro Val Ser Met Leu Thr His Phe Glu Lys Trp 
165 170 



<210> 21 

<211> 174 

<212> PRT 

<213> Archaeoglobus fulgidus 
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<400> 21 

Met Glu Glu Cys Leu Lys Met lie Tyr Thr Arg Arg Ser He Arg Val 
1 5 10 15 

Tyr Ser Asp Arg Gin He Ser Asp Glu Asp He Glu Lys lie Leu Lys 
20 25 30 

Ala Ala Met Leu Ala Pro Ser Ala Gly Asn Glu Gin Pro Trp His Phe 
35 40 45 

He Val Val Arg Asp Arg Glu Met Leu Lys Lys Met Ser Glu Ala Phe 
50 55 60 

Thr Phe Gly Gin Met Leu Pro Asn Ala Ser Ala Ala He Val Val Cys 
65 " 70 75 80 

Ala Asp Pro Lys Leu Ser Lys Tyr Pro Tyr Asp Met Trp Val Gin Asp 
85 90 95 

Cys Ser Ala Ala Thr Glu Asn He Leu Leu Ala Ala Arg Cys Leu Gly 
100 105 110 

He Gly Ser Val Trp Leu Gly Val Tyr Pro Arg Glu Glu Arg Met Lys 
115 120 125 

Ala Leu Arg Glu Leu Leu Gly He Pro Glu Asn He Val Val Phe Ser 
130 ~ 135 140 

Val Val Ser Leu Gly Tyr Pro Lys Asp Glu Lys Asp Phe Tyr Glu Ala 
145 * 150 155 160 

Asp Asp Arg Phe Asn Pro Asp Arg He His Arg Glu Lys Trp 
165 170 

<210> 22 
<211> 606 
<212> DNA 

<213> Campylobacter jejuni 

<220> 

<221> CDS 

<222> (1) . . (606) 

<400> 22 

atg aaa aaa gaa ctt gaa att ttt age aca aga tat tct tgt aga aat 48 

Met Lys Lys Glu Leu Glu He Phe Ser Thr Arg Tyr Ser Cys Arg Asn 
15 10 15 

ttt aaa aat gaa aaa etc aaa aaa gag gat tta aat tct att tta gaa 96 
Phe Lys Asn Glu Lys Leu Lys Lys Glu Asp Leu Asn Ser He Leu Glu 
20 25 30 

ata gca aga tta age ccc agt tec ttg gga ctg gaa cct tgg aaa ttt 144 
He Ala Arg Leu Ser Pro Ser Ser Leu Gly Leu Glu Pro Trp Lys Phe 
35 40 45 

ata gta gtg caa gat gag aaa aga aaa gaa gaa ctt tct aaa att tgc 192 
He Val Val Gin Asp Glu Lys Arg Lys Glu Glu Leu Ser Lys He Cys 
50 55 60 

aat caa caa aaa cat gta aaa gat tgt get gca tta att ata ate att 240 
Asn Gin Gin Lys His Val Lys Asp Cys Ala Ala Leu He He He He 
65 " 70 75 80 

tea aga ctt gat ttt ttg gat tat ttt gaa gaa aaa ctt aga aaa aga 288 
Ser Arg Leu Asp Phe Leu Asp Tyr Phe Glu Glu Lys Leu Arg Lys Arg 
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85 90 95 



gat atg agt gaa aca gaa atg caa aaa cgc tta gat act tat atg cct 
Asp Met Ser Glu Thr Glu Met Gin Lvs Arg Leu Asp Thx Tyr Met Pro 
100 105 110 



get tta gga tat tgc aac gat aaa aaa aat cct caa aaa aat cgt ttt 
Ala Leu Gly Tyr Cys Asn Asd Lys Lys Asn Pro Gin Lys Asn Arg Phe 
180 185 190 

agt ttt gat gaa gtt gta aaa ttt att taa 
Ser Phe Asp Glu Val Val Lys Phe lie 
195 200 

<210> 23 
<211> 202 
<212> PRT 

<213> Campylobacter jejuni 
<400> 23 

Met Lys Lys Glu Leu Glu lie Phe Ser Thr Arg Tyr Ser Cys Arg Asn 
15 10 15 

Phe Lys Asn Glu Lys Leu Lys Lys Glu Asp Leu Asn Ser lie Leu Glu 
20 25 30 

lie Ala Arg Leu Ser Pro Ser Ser Leu Gly Leu Glu Pro Trp Lys Phe 
35 40 45 

lie Val Val Gin Asp Glu Lys Arg Lys Glu Glu Leu Ser Lys He Cys 
50 " 55 60 

Asn Gin Gin Lys His Val Lys Asp Cys Ala Ala Leu He He He He 
65 70 75 80 

Ser Arg Leu Asp Phe Leu Asp Tyr Phe Glu Glu Lys Leu Arg Lys Arg 
85 90 95 

Asp Met Ser Glu Thr Glu Met Gin Lys Arg Leu Asp Thr Tyr Met Pro 
100 105 110 

Phe Leu Lys Ser Leu Asn Gin Glu Gin Lys He Ser Tyr Ala Arg Glu 
115 120 125 

Gin Ala His He Ala Leu Ala Ser He Leu Tyr Ser Ala Asn Ala Leu 
130 135 140 

Asn He Ala Ser Cys Thr He Gly Gly Phe Asp Lys Glu Lys Leu Asp 
145 150 155 160 



336 



ttt tta aaa tct eta aat caa gaa caa aaa ata tct tat gca aga gaa 3 84 

Phe Leu Lys Ser Leu Asn Gin Glu Gin Lys He Ser Tyr Ala Arg Glu 
115 120 125 

caa get cat ata get eta get age ata ctt tac agt get aat get tta 432 

Gin Ala His He Ala Leu Ala Ser He Leu Tyr Ser Ala Asn Ala Leu 

130 135 140 

aat ata gca age tgc act ata ggt ggt ttt gat aaa gaa aag ctt gat 480 

Asn lie Ala Ser Cys Thr He Gly Gly Phe Asp Lys Glu Lys Leu Asp 
145 150 155 160 

tct tat tta tea ctt gat att caa aaa gaa aga tea agt ttg gtg gtg 528 

Ser Tyr Leu Ser Leu Asp He Gin Lys Glu Arg Ser Ser Leu Val Val 
165 170 175 



576 



606 
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Ser Tyr Leu Ser Leu Asp lie Gin Lys Glu Arg Ser Ser Leu Val Val 
165 170 175 

Ala Leu Gly Tyr Cys Asn Asp Lys Lys Asn Pro Gin Lys Asn Arg Phe 

180 185 190 

Ser Phe Asp Glu Val Val Lys Phe lie 

195 200 



<210> 24 

<211> 522 

<212> DNA 

<213> Porphyromonas gingival is 

<220> 

<221> CDS 

<222> (1) . . (522) 

<400> 24 

atg aaa aaa acg etc gta ata gtc gtt cac ccc gat ttg acc aaa tec 

Met Lys Lys Thr Leu Val He Val Val His Pro Asp Leu Thr Lys Ser 
1 5 10 15 

gtt ate aac aag get tgg gee aaa gee ate gaa ggt gca gee act ate 
Val He Asn Lys Ala Trp Ala Lys Ala He Glu Gly Ala Ala Thr He 
20 25 30 

cac cat etc tac gaa cag tat ccg aac gga caa ate gat eta gca cat 
His His Leu Tyr Glu Gin Tyr Pro Asn Gly Gin He Asp Leu Ala His 
35 "* 40 45 

gaa caa gee ctg ctg gag get cat gac cgc ate gtc ttc caa ttc ccc 
Glu Gin Ala Leu Leu Glu Ala His Asp Arg lie Val Phe Gin Phe Pro 
50 55 60 

etc tat tgg tat gca get ccc tat ctg ctg aag aag tgg atg gac gag 
Leu Tyr Trr> Tyr Ala Ala Pro Tyr Leu Leu Lys Lys Trp Met Asp Glu 
65 70 75 B0 



ttt gee gaa gga gca cag caa tgc cac acg ctg cga age tac ttg aat 
Phe Ala Glu Gly Ala Gin Gin Cys His Thr Leu Arg Ser Tyr Leu Asn 
115 120 125 



cat gec tgc tac gat tec tac aat cct cgc ctg ccg gaa atg ctg ccg 

His Ala Cvs Tyr Asp Ser Tyr Asn Pro Arg Leu Pro Glu Met Leu Pro 
145 * 150 155 160 

gec aac tgc gaa gee tat etc cgc ttt ate aaa gga gaa tga 

Ala Asn Cys Glu Ala Tyr Leu Arg Phe He Lys Gly Glu 
165 170 



48 



96 



144 



192 



240 



gtc ttt act gag ggc tgg gee tat ggt gee ggt gga gac aag atg gag 288 

Val Phe Thr Glu Gly Trp Ala Tyr Gly Ala Gly Gly Asp Lys Met Glu 
85 90 95 

ggt aaa gaa ate tgt gca gca gtc tec tgc gga tea ccc aaa tea get 33 6 

Gly Lys Glu He Cys Ala Ala Val Ser Cys Gly Ser Pro Lys Ser Ala 
100 105 HO 



384 



gta ttc gac ggg ata get get ttc ctg cgc get cga ttc acc ggc tac 432 
Val Phe Asp Gly He Ala Ala Phe Leu Arg Ala Arg Phe Thr Gly Tyr 
130 135 140 



480 



522 



<210> 25 
<211> 174 
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<212> PRT 

<213> Porphyromonas gingival is 
<400> 25 

Met Lys Lys Thr Leu Val lie Val Val His Pro Asp Leu Thr Lys Ser 
15 10 15 

Val lie Asn Lys Ala Trp Ala Lys Ala lie Glu Gly Ala Ala Thr lie 
20 25 30 

His His Leu Tyr Glu Gin Tyr Pro Asn Gly Gin lie Asp Leu Ala His 
35 40 45 

Glu Gin Ala Leu Leu Glu Ala His Asp Arg lie Val Phe Gin Phe Pro 
50 55 60 

Leu Tyr Trp Tyr Ala Ala Pro Tyr Leu Leu Lys Lys Trp Met Asp Glu 
65 7 0 75 80 

Val Phe Thr Glu Gly Trp Ala Tyr Gly Ala Gly Gly Asp Lys Met Glu 
85 90 95 

Gly Lys Glu lie Cys Ala Ala Val Ser Cys Gly Ser Pro Lys Ser Ala 
100 105 110 

Phe Ala Glu Gly Ala Gin Gin Cys His Thr Leu Arg Ser Tyr Leu Asn 
115 120 125 

Val Phe Asp Gly lie Ala Ala Phe Leu Arg Ala Arg Phe Thr Gly Tyr 
130 135 140 

His Ala Cys Tyr Asp Ser Tyr Asn Pro Arg Leu Pro Glu Met Leu Pro 
145 " 150 " 155 160 

Ala Asn Cys Glu Ala Tyr Leu Arg Phe lie Lys Gly Glu 
165 170 

<210> 26 
<211> 552 
<212> DNA 

<213> Yersinia pestis 

<220> 

<221> CDS 

<222> (1) . . (552) 

<400> 26 

atg atg ttg cag ccg ccg aag gtt ttg ctg ctg tat gcc cat ccg gaa 48 

Met Met Leu Gin Pro Pro Lys Val Leu Leu Leu Tyr Ala His Pro Glu 
15 10 15 

tea cag gac teg gtc get aac egg gtt tta ctg caa ccg gta cag cag 96 
Ser Gin Asp Ser Val Ala Asn Arg Val Leu Leu Gin Pro Val Gin Gin 
20 25 30 

tta gaa cat gtc act gtg cac gat ctt tat gca cat tat ccg gat ttc 144 
Leu Glu His Val Thr Val His Asp Leu Tyr Ala His Tyr Pro Asp Phe 
35 40 45 

ttt att gat att cat cat gag cag caa ttg eta cgt gat cat caa gtt 192 
Phe lie Asp lie His His Glu Gin Gin Leu Leu Arg Asp His Gin Val 
50 55 60 

att gta ttt caa cat cct tta tat act tac agt tgc cct gca tta ctg 240 
lie Val Phe Gin His Pro Leu Tyr Thr Tyr Ser Cys Pro Ala Leu Leu 
65 70 75 80 
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aaa gag tgg ttg gat egg gta ctg gca cgt ggt ttc gec aat ggc get 288 

Lys Glu Trp Leu Asp Arg Val Leu Ala Arg Glv Phe Ala Asn Gly Val 

85 90 95 

ggc ggc cat gca ctg acg gga aag cac tgg cgc teg gtg att acc acc 3 36 
Gly Gly His Ala Leu Thar Gly Lys His Trp Arg Ser Val lie Thr Thr 
LOO 105 110 

ggt gag cag gag gga act tac cgt att ggg gga tat aac cgt tac cca 384 
Gly Glu Gin Glu Gly Thr Tyr Arg lie Gly Gly Tyr Asn Arg Tyr Pro 
115 120 125 

atg gaa gac att ctg cgt cct ttc gaa ttg acg gcg get atg tgc cat 4 32 
Met Glu Asp lie Leu Arg Pro Phe Glu Leu Thr Ala Ala Met Cys His 
130 135 140 

atg cat tgg att aat ccg atg att att tac tgg gec aga cgc caa aag 480 
Met His Trp lie Asn Pro Met He He Tyr Trp Ala Arg Arg Gin Lys 
145 ISO 155 160 

ccg gaa aca etc gec agt cac gca caa get tat gtg caa tgg ctg cag 52 8 
Pro Glu Thr Leu Ala Ser His Ala Gin Ala Tyr Val Gin Trp Leu Gin 
165 170 175 

tea ccg etc acg aga gga etc tga 552 
Ser Pro Leu Thr Arg Gly Leu 
180 



<210> 27 
<211> 184 
<212> PRT 

<213> Yersinia pestis 
<400> 27 

Met Met Leu Gin Pro Pro Lys Val Leu Leu Leu Tyr Ala His Pro Glu 
1-5 10 15 

Ser Gin Asp Ser Val Ala Asn Arg Val Leu Leu Gin Pro Val Gin Gin 
20 25 30 

Leu Glu His Val Thr Val His Asp Leu Tyr Ala His Tyr Pro Asp Phe 
35 40 45 

Phe He Asp He His His Glu Gin Gin Leu Leu Arg Asp His Gin Val 
50 55 60 

He Val Phe Gin His Pro Leu Tyr Thr Tyr Ser Cys Pro Ala Leu Leu 
65 70 75 80 

Lys Glu Trp Leu Asp Arg Val Leu Ala Arg Gly Phe Ala Asn Gly Val 
85 ~ 90 95 

Gly Gly His Ala Leu Thr Gly Lys His Trp Arg Ser Val He Thr Thr 
100 105 110 

Gly Glu Gin Glu Gly Thr Tyr Arg He Gly Gly Tyr Asn Arg Tyr Pro 
115 120 125 

Met Glu Asp He Leu Arg Pro Phe Glu Leu Thr Ala Ala Met Cys His 
130 135 140 

Met His Trp He Asn Pro Met He He Tyr Trp Ala Arg Arg Gin Lys 
145 150 155 160 

Pro Glu Thr Leu Ala Ser His Ala Gin Ala Tyr Val Gin Trp Leu Gin 
165 170 175 
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Ser Pro Leu Thr Arg Gly Leu 
180 



<210> 28 

<211> 633 

<212> DNA 

<213> Helicobacter pylori 

<400> 28 



atgaaatttt 


tggatcaaga 


aaaaagaaga 


caattgctaa 


acgagcgcca 


ttcttgcaag 


60 


atgttcgaca 


gccattatga 


gttttctagt 


gaagaattag 


aagaaatcgc 


tgaaatcgct 


120 


aggctatcgc 


caagctctta 


caacacgcag 


ccatggcatt 


ttgtgatggt 


tactaataag 


180 


gatttaaaaa 


aacaaattgc 


agcgcacagc 


tattttaatg 


aagaaatgat 


taaaagcgct 


240 


tcagcgttaa 


tggtggtatg 


ctctttaaaa 


cccagcgagt 


tgttacccac 


tggccactac 


300 


atgcaaaacc 


tttacccgga 


gtcttataag 


gttagagtga 


tcccctcttt 


tgctcaaatg 


360 


cttggcgtga 


gattcaacca 


cagcatgcaa 


aaattagaaa 


gctatatttt 


ggagcaatgc 


420 


tatatcgctg 


tggggcaaat 


ttgcatgggc 


gtgagcttaa 


tgggattgga 


tagttgcatt 


480 


attggaggct 


ttgatccttt 


aaaagtgggc 


gaagttttag 


aagagcgtat 


caataaacct 


540 


aaaatcgcat 


gcttgatcgc 


tttgggcaag 


agggtggcag 


aagcgagcca 


aaaatcaaga 


600 


aaatcaaaag 


ttgatgccat 


tacttggttg 


tga 






633 



<210> 29 
<211> 210 
<212> PRT 

<213> Helicobacter pylori 
<400> 29 

Met Lys Phe Leu Asp Gin Glu Lys Arg Arg Gin Leu Leu Asn Glu Arg 
15 10 15 

His Ser Cys Lys Met Phe Asp Ser His Tyr Glu Phe Ser Ser Glu Glu 
20 25 30 

Leu Glu Glu lie Ala Glu lie Ala Arg Leu Ser Pro Ser Ser Tyr Asn 
35 40 45 

Thr Gin Pro Trp His Phe Val Met Val Thr Asn Lys Asp Leu Lys Lys 
50 55 60 

Gin lie Ala Ala His Ser Tyr Phe Asn Glu Glu Met He Lys Ser Ala 
65 70 75 80 

Ser Ala Leu Met Val Val Cys Ser Leu Lys Pro Ser Glu Leu Leu Pro 
85 90 95 

Thr Gly His Tyr Met Gin Asn Leu Tyr Pro Glu Ser Tyr Lys Val Arg 
100 105 110 

Val lie Pro Ser Phe Ala Gin Met Leu Gly Val Arg Phe Asn His Ser 
115 120 125 

Met Gin Lys Leu Glu Ser Tyr He Leu Glu Gin Cys Tyr lie Ala Val 
130 135 140 

Gly Gin He Cys Met Gly Val Ser Leu Met Gly Leu Asp Ser Cys He 
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145 

He Gly Gly Phe 



He Asn Lys Pro 
180 

Ala Glu Ala Ser 
195 

Trp Leu 

210 



150 

Asp Pro Leu Lys 
165 

Lys He Ala Cys 

Gin Lys Ser Arg 
200 



155 

Val Gly Glu Val 
170 

Leu He Ala Leu 
185 

Lys Ser Lys Val 



160 

Leu Glu Glu Arg 
175 

Gly Lys Arg Val 
190 

Asp Ala He Thr 
205 
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